ea 


BIFSLICE 
MICROPROCESSOR 
DESION 


JOHN MACK AND JIMBRES! 


By John Mick, Engineering 
Manager for Systems and 
Applications of Digital Bipolar 
Products, and Jim Brick, 
Manager, Microprocessor 
Support, Advanced Micro 
Devices, Inc. * 


Applications requiring more than eight 
bits of precision, substantial amounts of 
arithmetic processing, adherence to a 
predefined instruction set, or blazing 
speed need something special. More 
than a fixed-instruction-set MOS mi- 
croprocessor has to offer. The keys are 
microprogramming capability and bi- 
polar LSI. 


Today the 2900 family of bit-slice mi- 
croprocessor components dominates 
microprogrammable bipolar LSI. But 
until now the critical information has 
been buried in a mass of theory and 
application notes. 


Here, under one cover, all the theory is 
pared down to essentials and presented 
in a coherent manner. Application ex- 
amples are used liberally to illustrate 
important points. And the authors énd 
up designing not one, but two complete 


16-bit microprogrammable bit-slice 
microprocessors. 
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New integrated circults are usually accom- 
panied by a wealth of theory and data 
sheets. Shortly thereafter follow the applica- 
cation notes. The introduction of micropro- 
grammable LSI parts, such as the Am2901 
and subsequent ICs in the family, adhered to 
this pattern. We thought this was adequate 
in light of the previously successful intro- 
duction of fixed-instruction-set MOS mlcro- 
processors, which were more complex. 

However, bit-slice microprocessar design 
proved more formidable than first realized. 
One reason was the intimate relationship 
between parts. These designs required the 
designer to pick and.choose parts: How 
many slices are needed to do the job? 
Which microprogram sequencer and/or con- 
troller to select? Is a carry lookahead gen- 
erator needed? And on, and on and. ... All 
these devices had to play together; no single 
device was complete by itself. 

For this added up-front design effort, the 
user got blazing speed and the utmost 
flexibility. The latter proved fhe second hin- 
derance to easy designing. Users now had 
to design the instruction set as well as the 
hardware and applications programs. They 
no longer had the luxury of a fixed-instruction 
set. On the’ other hand, they could eliminate 
unnecessary instructions, easily modify or 
add instructions at a later date or emulate 
the existing instruction set of a slower CPU. 

Complicating matters was the fact that the 
2900 family did not spring whole into the 
world. Parts were introduced and redesigned 
over a period of years as engineering and 
processing resources could be brought to 


bear. This evolutionary process stil! goes on. 

To alleviate matters, Advanced Micro 
Devices announced a nine-part course in 
microprogrammable microprocessing, each 
part to standalone but tobuild logically upon 
the preceding part. And, because engineer- 
ing talent is our most important resource, 
this course was to unfold over a 22-month 
period. 

Since completion of the course, there has 
been no diminishing in demand for informa- 
tion on the material covered. In fact, the 
market for bipolar microprogrammable LSI 
parts doubled in each of the previous two 
years and showed no signs of slowing. So, 
as our copies of individual course materials 
dwindled, we thought it only natural to bring 
them all together under one cover. This book 
is the result. 

We think the extraordinary time and ef- 
fort was well worth it. 
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PREFACE 


In this introductory Chapter we intend to: 

1). develop a common terminology for future chapters. 

2). Introduce several stored-program-computer design topics. 

3). define some of the computer architect's problems (which 
will be solved in the subsequent chapters). 


In order to achieve these goals, we will start with computer 
basics. it should be stressed that approaches and solutions 
can be chosen which are different from the ones described in 
this and the subsequent chapters. However, the general ideas 
described will be appropriate to gain familiarity with-the micro- 
programmable bit-slice devices in order to use them in any 
design configuration. 


BACK TO THE BASICS... 


A STORED-PROGRAM-COMPUTER Is defined as a machine 
. Capable of manipulating data according to predefined rules 
(instructions), where the program (collection of instructions) 
and data are stored in_its memory (Fig. 1). Without some 
means of communication with the extemal world, the program 
and the data cannot be loaded into the memory nor can the 
results be read out. Therefore, an input/output device is re- 
quired as shown in Fig. 2. 


Figure 2, YO Added to the Basic Stored-Program 
Computer. 
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The memory fs usually organized in words, each containing N 
bits of information. A unique address is allocated for each 
word which defines Its position relative to other words. The. 
Central Processor Unit (CPU) usually reads or writes one. 
word at a time by addressing the memory and then when the 
memory ts ready, reading the contents of the word or writing 
New contents into that word. To perform this operation, two 
registers are usually used: The Memory Address Register 
(MAR), which contains the address and the Memory Data 
Register (MDR) which contains the data (Fig. 3). 


Figure 3. MAR and MOR Depicted for a 
Stored-Program Computer. 


Since accessing a memory (reading from it or writing into it) Is 
usually a relatively slow procedure, it is advantageous to have. 
a few memory locations inside the CPU which can be read 
from or written into very fast. These locations are usually 
called Accumulators or Working Registers. Having these fast 
access registers inside the CPU (Fig. 4) enables many opera- 
tions to be carried out without referring to the memory 
(through the MAR and the MDR) and therefore these opera- 
tions are executed faster. 


The unit which actually performs the data manipulation Is 
called the Arithmetic & Logic Unit (ALU). It has two inputs for 
operands and one output for the result. It usually operates on 
all the bits of a word in parallel. The ALU can perform all or 


- part of the following operations: ° 


Arithmetic Logical 
Add OR" 
Complement AND 
Subtract XOR ° 
Increment NAND 
Decrement NOR 

XNOR 

Complement 


In some architectures, one of the operands must always be in 
a special register (accumulator) and the result of the ALU op- 
eration is always transferred to this register. In a more general 
CPU, any two of the internal registers can contain the 
operands and the result of the ALU operation can be trans- 
ferred to any one of them. 


Another very uselul feature of a CPU is the ability to shift the 
contents of a register or the output of the ALU one or more 
bits in either direction as shown in Fig. 5. 


Figure 5. ALU and Shifter Added to the CPU Design. 


We now have the elements to do any data manipulation re- 
Quired but we still need a unit which can properly set the MAR 
in order to find the next instruction of the program in the 
memory and to find its associated data. This unit is called the 
Program Control Unit (PCU) and its role Is to load the MAR 
with the correct addres in order to find the next instruction or 


data item or to point to a memory location where a data word 
should be written. 


Often, the program steps (instructions, data) are written In the 
memory in consecutive locations, starting at address zero or 
at any other predefined address. The PCU can simply be in- 
cremented after each memory access thereby pointing to the 
address of the next instruction or data item. This counter-type 
PCU has very little flexibility. Sometimes we wish to change 
the “normal” flow of the instructions, particularty if we want to 
enable our computer to “make decisions” according to condi- 
tions prevailing at the current execution point. For example, 
we may want lo execute one of two ditferent sequences ol in- 
Structons depending upon the result of the last operation per- 
formed. This is accomplished by loading the MAR with a new 
value (the address of the rext instruction to be executed) 
rather than incrementing it. This operation is called a 
BRANCH or JUMP and can be unconditional (which allows 
execution of a non-contiguous string of instructions) or condi- 
tional (depending, for example, on whether the last opera- 
tion’s result was zero or not, was negative or positive, true or 
false, etc.). 


Even more flexibility can be achieved by using a stack (a 
group of temporary intemal or external memory locations) to 
store vital data. A stack pointer is used to address the mem- 
ory location currently at the top of the stack. indirect and rela- 
tive addressing and other sophisticated addressing modes (alli 
of which can be handled by the PCU) will be discussed later. 
Meanwhile, Fig. 5 shows the PCU as a part of the CPU. 


Executing an instruction in our computer now requires the 
following steps: 


a). The PCU loads the address of the next instruction to the 
MAR and signals to the memory that a Read is re- 
quested. incidentally, the PCU may be as simple as a 
Program Counter equal to the address width. The mem- 
ory loads the MDR with the contents of the location ad- 
dressed. . 

b). The CPU decodes the instruction: i.e., (assuming 
operands are in internal registers) selects the proper reg- 
isters to feed the ALU, selects the proper function to be 
performed by the ALU, sets up the shifter to displace the 
result, if required, and selects the register in which the 
result should be stored. 

c). The ALU performs the function desired. 

d). The result is loaded into the destination register. 

e). The result is also examined to determine whether a 
BRANCH Is to be performed. 

f). The PCU caiculates the address of the next instr ction, 
(usually called a "FETCH"). 


This procedure becomes more complicated if the operands 
are not storéd in the"internal registers or if the result is not to 
be stored in one of them. Let's take an example instruction 
using relative addressing: 


“Take the first operand from the location specified by the 
sum of the word after this instruction (immediate) and the 
contents of register R1; take the second operand from 
the location specified by the sum of the second word 
‘after this instruction and the contents of R2; add the two 
operands and place the result in the location specified by 
the sum of the third word after this instruction and the 
contents of register R3. Then execute the instruction lo- 
cated at the address, which is the sum of the fourth word 
after this instruction and the contents of register R4 if 
there is a carry resulting from the addition. Otherwise 
continue sequentially”. 


The steps required to execute this instruction are as follows: 


a). The PCU loads the address of the next Instruction to the 
MAR, signalling to the memory that a Read is requested. 
The memory loads the MDR with the contents of the loca- 
tion addressed. 

b). The CPU decdédes the instruction, i.e., initiates the follow- 
ing steps. 

c). The PCU is incremented and the next word is read from 
the memory. s 

d). Register R1 and the MDR are selected as source regis- | 
ters, MAR Is the destination register. 

e). The ALU performs “ADD” and the result is placed in the 
MAR. 

f). The first operand is fetched from the memory and placed, 
for example, in RS. 

g). The PCU is incremented and the hext word is read from 
the memory. 

h). Register R2 and the MDR are selected again as aeuiee 
registers and MAR as the destination. 


i). The ALU performs "ADD" and the result is placed in 
MAR. 

]). The second operand is fetched trom the memory and is 
placed, for example, in R6. 


k). The PCU is incremented, the next word is read from the 


memory. 

f). Register R3 and the MOR are selected as source regis- 
ters, the MAR as destination. 

m). The ALU performs “ADD” and the result is placed in the 
MAR, which now points to the location where the sum of 
the operands should be stored. 

n). Registers RS and R6 are selected as sources (they con- 
tain the operands), MDR is now the destination. 

0.) The ALU performs "ADD" and the result is placed in 
MDR. 

p). A memory write cycle takes place and the contents of the 
MDR is stored at the desired address. 

q). The carry is examined to determine the next step to be 
performed. Assume there is no carry. 

r). The PCU is incremented twice (in order to skip the fifth 
word of the present instruction). it now points to the ad- 
dress of the next instruction. 


As can be seen, 18 steps were used to perform a single ad- 
dition using this complex relative addressing scheme. Obvi- 
ously, our CPU needs some kind of “coordinator” which can: 


1). Decode an instruction fetched from the memory. 

2). Initiate the proper cycle of steps to be performed. 

3). Set up the various controls for each step. 

4). Execute the steps in an orderly sequence. 

5). Make decisions according to the state of various signals 
(conditions). 


We will call this coordinator the Computer Controt Unit (CCU) 
and it is depicted ‘in Fig. 6. Our CPU is now complete (more 
of less) and we will go into more detail later. 


THE MEMORY 


Let's now discuss the memory. The information stored in the 
memory is organized in words, where each word consists of N 
bits. N may be as small as 8 for very simple processors or as 
large as 64 in more powerful machines. The most common 
memory width for minicomputers ts 16 bits. The number N is 
called the width of the memory and the number of bits in the 
MOR is obviously also N; equal to the width of the memory. 


The depth of a memory is the number of words it contains. 
With a MAR having k bits, 2" consecutive memory locations 
can be addressed. The addresses start from zero and range 
through 2"-1. 


The read access time of a memory directy accessible by the 
CPU Is the time needed from stable address at the memory 
until the data is property stored in the MDR. This access time 
depends on the type of memory used and can be as low as a 
few tens of nanoseconds and as large as several micro- 
seconds. Using high speed memory improves the pertor- 
mance of the computer as less time is wasted waiting for the 
memory to respond. in general, faster memories are costly, 
take more PC board area and use more power which results 
in more heat. A 32 bit wide, 2K (2048) word memory with 50 
NManosecond access time may need 10 amps from the +5V 
power supply and may require a board area of 10° x 6”. Yet 
this is a very small memory space. 


It is usually not justified to have very large high-speed 
memories. Not all the programs and associated data need to 
reside in this memory at once. We may have the current pro- 
gram (or only a part of it) in the memory while other programs 
or data files can reside elsewhere and be brought into mem- 
ory during the appropriate part of the program when needed. 
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Figure 6. A Computer Control Unit (CCU) Inctuded In a CPU. 


This “elsewhere” may be a magnetic tape, cassette, disk, 
diskette, etc. and we will call it Bulk Memory. The distinctive 
characteristics of Bulk Memory are: 

1). very large capacity 

2). non-volatile (retains the information when not in use) 

3). not randomly accessible 

4). leng access time 

5). inexpensive (per bit) 


Usually, Bu‘k Memory devices are serially accessible, .e., the 
access time for the first word is large, but then consecutive 
words can be accessed relatively fast. 


In a later chapter the most efficient process of communication 
between the main and the bulk memory, called the Direct 
Memory Access (DMA), will be discussed in detail. 


THE EXTERNAL WORLD 


in any useful machine, some means of communicating wilh 
the extemal word is needed. It may be a keyboard, a CRT, a 
Card reader, a paper tape punch or, in @ process controller, 
tusding sensors of positioning actuators. The common de- 
nominator of almost all of the input/output devices is that they 
ere much slower than the CPU and therefore a timing prob- 
hem anses; the CPU must know when the /O device is ready 
for data transter. Usually, a signal is sent by the device to the 
CPU in order to draw its attention. The CPU now can do one 
of two things: 


1). Test this signal periodically and when it is present, jump 
to & program which handles the data transfer. This type 
ot operation is called “Polling”. This technique has two 


PROCESSOR UNIT (APU) 


major drawbacks: First, appreciable computer time is 
Spent performing these periodic tests where most of them 


* will fail (no “Ready” signal present). Second, the recogni- 


2). 


tion by the computer CPU of the appearance of a signal 
ts delayed until the CPU arrives at this device in its pol- 
ling sequence. 


Imagine what will happen if there are a large number of 
VO devices. Long latency times (delays) will occur if 
many VO devices are busy simultaneously. 


Include some hardware in the CPU which can sense the 
Presence of a “Ready” signal and interrupt the normal 
flow of the instructions and force the computer to “Jump” 
to the VO service program whenever there is a request. It 
can even send the CPU to different programs according 
to the VO device whose “Ready” flag was detected and 
even establish priority among the different devices if more 
than one device would like to have the CPU's attention at 
the same time. Moreover, under program control, this cir- 
Cuitry can ignore some or ail of the signals if the com- 
puter CPL! must not be interrupted at that time. Obviously 
by paying the price of very little hardware, we gain 
enormously in computer performance. We will call this 
hardware the “interrupt Controller" and will discuss it 
thoroughly later. 


Our computer is now depicted in Fig. 7. We have Included the 
ALU, the internal register file and the shift circuit in one block. 


which we call the “Arithmetic Processor Unit.” 


In the following pages and In the subsequent chapters, we will 
deal in more detail with each area of the machine. 


Figure 7. The Stored-Program-Computer with DMA and interrupt Control Added. 


A WORD ABOUT THE INSTRUCTION SET 


The internal architecture of the CPU depends to some extent 
on the instruction set the computer is to execute. if the in- 
Struction set is large, some of the instructions usually are 
more complicated and the computer is more powerful, faster 
and more efficient. On the pther hand, the internal circuitry is 
also more complicated. Some examples of these tradeofts are 
as follows. 


’ ALU Processing Capability: 


Although with three basic functions (add, complement, and 


OR/AND) all the arithmetic and logic operations can be per- 


. formed, most processors are built to perform subtract, NAND, 


XOR, etc. This is perhaps the most outstanding example of 
how performance and speed can be gained with little penalty 
on the complexity of the machine. With the added features an 
XOR operation can be performed in one instruction instead 
of 5. 


Data Movement: 


Let us assume 4 different computers whose data movement 
Capabilities are described below: 


Machine A). A word can be read from the memory and 
loaded into Register A only. The contents of Register A can 
be written into the memory, of can be moved into any other 
register. The contents of any register can be copied into 
Register A. 


Machine 8). The contents of any register can be copied into 
any other register or it can be written into the memory. A word 
fead from the memory can be loaded Into any register. 


Machine C). As B above but With the added capability to read 
{rom one location in memory, to write that word into another 
location in memory. 


Machine D). As C above and also the memory-to-memory 
operation can be performed on consecutive addresses repeti- 
tively. The number of word transfers (or upper and lower ad- 
dress limits) are specified by the instruction. 


Machine A has very limited data movement capability. In 
order to perform an operation on two operands residing in the 
memory, we have to: 


1). Bring the first operand from the memory into Register A. 
2). Copy it into another register. 

3). Bring the second operand into Register A. 

4). Perform the operation required (result in A). - 

5). Store the contents of Register A into the memory. 


If consecutive operations are required with several partial re- 
sults, the drawbacks of machine A become more annoying, 
especially if tha number of internal registers is small. 


Moving a data block from one location in the memory to 
another location can be performed by one instruction in com- 
puter D, but requires the transfer of each word first to an in- 
ternal register then to the new memory location in machines 
A, 8 (two instructions for each word transferred). 


Obviously the decoding, multipiexing and sequencing of the 
computers grow in complexity as we proceed from machine A 
to machine D. We trade the compiexity of hardware versus 
the sottware (programming), speed and performance. 


Addressing: 
The operands for an operation can be found in several ways: 


@ The operand is an explicit part of the instruction (Im- 
mediate) 

@ The address of the operand is an explicit part of the in- 

_ Struction. (Direct) 

@ The address of the operand is in an internal register; the - 
register itself is specified by the instruction. (RR) 

@ The address of the operand Is the sum of the contents of 
an intemal register (specified by the instruction) and a 
number (called the displacement) which is an explicit part 
of the instruction. (RX) 

@ The contents of an internal register are added to a 
number found in an address specified by the instruction. 
The sum ts the address of the operand. (Indirect) 

e@ The contents of an internal register are added to a 
number which is an explicit part of the instruction. The 
sum points to the location where the address of the 
operand is written. (Indirect) 

@ The contents of an internal register are added to a 
number which can be found at the location explicitly 
specified by the instruction. The sum thus formed points to 
a location where the address of the operand is written. 

e@ Etc. 


Many other schemes can be formed by combining’ the above 
Operations of by chaining them. In every case an “Effective 
Address" must be found by calculations and/or memory refer- 
ences. Again, we can gain pertormance by using more 
sophisticated addressing schemes but we will pay for it by 
adding complexity to our machine, especially In its control por- 
tion. 


TIMING, SEQUENCING, CONTROLLING 


In the previous paragraphs we have shown that we can gain 
performance in our computer by having a more complicated 
instruction set but more complex hardware is required, usuaity 
in the CCU. We have also shown an example for an “Add” 
operation which required 18 precisely controlled steps. Even if 
we assume that some of them can be performed simultane- 
ously, we will need a multiphase clock to control these steps 
— something like that shown in Fig. 8. We can now load an 
instruction register at the beginning of an instruction with the 
first word of the instruction (the OP CODE) as is shown in 
Fig. 9. Using the outputs of the Instruction Register (iRg to 
IRpy.1), the different phases of the clock and the various condi- 
tion inputs to the CCU, we can now try to write the logical 
equations which should satisty all of the steps of all the in- 
structions of our instruction set. Then use Karnough maps or 
other techniques to reduce these equations and finally realize 
them using AND, OR, INVERT gates and Flip Flops. Simple, 
isn't it? Imagine the complexity of a scphisticated computer 
and the debugsing process it needs! 


The question posed immediately is: Isn't there a more or- 
ganized and more easily understandable way to do that? Or, 
perhaps, can we have some piocessor do the job for us? 
Can't we have some kind of “micro-machine” which can take 
care of all the timing, sequencing and controlling jobs of our 
computer — a computor inside the computer? With the advent 
of the Am2900 family — new Bipolar LS! devices — the an- 
swer is: Yes, we can! 
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Figure 8. An &Phase Clock. 


Figure 9. The Instruction Register Bits. 
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Figure 10. The Micromachine. 


THE MICRO-MACHINE 


What we need is essentially a machine which can execute a 
number of well defined sequences. But, remember that this is 
exactly the purpose of a stored program computer. The only 
difference between our micro-machine and a general purpose 
computer is that in the general purpose computer the program 
to be executed Is changed from task to task, while in our 
micro-machine it is fixed. This allows the use of PROM for its 
memory instead of the RAM needed in the general purpose 
(GP) computer. Our Computer Contro! Unit (CCU) using this 
micro-machine may now look like Figure 10. 


Basically, a microprogrammed machine is one in which a 
coherent sequence of microinstructions is used to execute 
various commands required by the machine. If the machine is 
a computer, each sequence of microinstructions can be made 
to execute a machine instruction. All of the little elemental 
tasks performed by the machine in executing the machine in- 
struction are called microinstructions. The storage area for 


these microinstructions is usually called the microprogram 
memory. 


A microinstruction usually has two primary parts. These are: 
(1) the definition and control of all elemental micro-operations 
to be carried out and (2) the definition and control of the ad- 
dress of the next microinstruction to be executed. 


The definition of the various micro-operations to be carried out 
usually Includes such things as ALU source operand selec- 


* tion, ALU function, ALU destination, carry control, shift control, 


interrupt contro!, data-in and data-out control, and so forth. 
The definition of the next microinstruction function usually in- 
cludes identifying the source selection of the next micro- 
instruction address and, in some cases, supplying the actual 
value of that microinstruction address. 


Microprogrammed machines are usually distinguished from 
non-microprogrammed machines in the following manner. 
Oider, non-microproorammed machines implemented the con- 
trol function by using combinations of gates and flip-flops 
connected In a somewhat random fashion in order to generate 
the required timing and control signals for the machine. Mi- 
cioprogrammed machines, on the other hand, are normally 


considered highty ordered and more organized with regard to 
the control function field. In its simplest definition, a micro- 
Program control unit consists of the microprogram memory 
and the structure required to determine the address of the 
next microinstruction. 


The OP-CODE (type of instruction to be executed by the 
computer) is loaded into the Instruction Register and the In- 
Struction Decoder decodes it. Actually, it generates the micro- 
address where the first step of the execution sequence for 
that instruction resides in the microprogram memory. The 
Am2910 sequencer then generates the microaddress of the 
next microinstruction. The microprogram data supplies the 
control signals we need to control all the parts of the com- 


puter (and there are a lot of them), including the sequencer 
itself. When all the steps of a machine instruction are exe- 
cuted, the microprogram will cause the reading (fetch) of the 
next machine instruction from the computer main memory. 
Typicalty, the Computer Control Unit is used to fetch instruc- 
tions and decode them using a PROM for mapping the op 
code to the initial address of the sequence of microinstruc- 
tions used to execute this particular instruction. It will also 
fetch all of the operands needed by the machine Instruction 
and deliver them to the ALU for processing. An example of 
the flow of a typical Computer Control Unit is shown in Figure 
11, 


Assume the OP-CODE of the machine instruction that we 
fetch is 8 bits wide. This allows us to execule a minimum of 
256 different instructions. Assume also that an average of 6 
steps are needed to execute these instructions. Even if sepa- 
rate microprogram memory locations are used, a depth of this 
microprogram memory is only 1-1/2K (K = 1024). But in that 
case, the sequencer can almost be replaced by a simple 
counter. Usually we would like to share some micro-routines 
among different instructions. With very litte effort, we can 
shrink the depth of the microprogram memory of Figure 10 to 
less than 1/2K. Of course the sequencer will be a littie more 
sophisticated; it will perform conditional Branch and micro- 


‘subroutine CALL's; but we still don't need the complicated 


addressing schemes for microprogram control as were de- 
scribed earlier as a part of the machine instruction set. 


On the other hand, the width of our microprogram memory 
may be large — maybe 60 to 100 bits. This will depend on the 
number of control lines needed in our computer. This is of no 
great disadvantage since the price of PROM devices is drop- 
ping constantly. In a future chapter we will discuss techniques 
to reduce the depth and width of the microprogram memory to 
save cost. 


It is important to understand the distinction between machine 
level instructions and microprogram instructions. Figure 12 
shows a typical machine instruction for a 16 bit minicomputer 
that has an &-bit opcode to identify one of 256 instructions; a 
4bit source register specification to identity one of 16 source 
registers and a 4-bit destination register specification to iden- 
tify one of 16 destination registers. The microprogram instruc- 
tion of Figure 12 may contain from 32 to 128 bits in a typical 
design; or even more bits in a vary fast, highly parallel mi- 
crocoded machine. This microinstruction word usually will con- 
tain fields for the ALU source operand, ALU function, ALU des- 
tination, status load enable, shitt multiplexer control, bus 


Amase3 
SOURCE 


32 TO 128 BITS. 


The machine instruction is 16-bits and consists of an op code, source register and destination register specifica- 
tion. The microprogram instruction defines all the elemental signals to control the various pieces of the machine. 


Figure 12. 
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cycle control, etc. These fields are used to control the vanous 
devices w.thin the machine so that its execution is as desired 
on each clock cycle. This is more straightforward than using 
combinatorial logic and yields a more organized design. 


Let us now compare the depth-over-width (d/‘w) ratio of the 
computer's main memory to that of our microprogram mem- 
ory. 


tn the Am9069A type microprocessor, the data field is 8 bits 
and the address field is 16 bits, allowing direct addressing of 
64K locations. The ratio d'w is 8K. In some minicomputers, 
the data width is 16-32 bits and the addressing capability Is 
64-128K. The d/w rabo is about the same. In larger computers 
with 32-64 bit data width, we find 256-512K deep memories or 
even deeper ones. The d/w ratio again is 8K at least. 


On the other hand, the d/w ratio in microprogram memories is 
seldom greater than a few tens. Even if we assume that it is 
2K deep and only 64 bits wide, we arrive at a dw ratio of only 
32; usually @ will be around 10. It is much easier to conto! a 
machine with a diw ratio of 10 to 20 than to control one with 
dw = 6K 


ONE MORE WORD 


We have suggested a replacement of the “random logic” 
realizaton of the CCU by a micro-machine. We call this a 
“Microprogrammed Architecture”. Perhaps the biggest advan- 
tage of this type of architecture is the ease of structuring the 
contro! sequence. We allocate a bit or a group of bits in the 
Mucroprogram memory to control a certain function (e.g.: ALU 
source register selection, ALU function, ALU destination selec- 
ton, conditon selection, next address calculation selection, 
MOR destnation selection, MAR source selection, etc., etc.) 
and for each microstep we write the appropriate state for 
these bits (LOW-HIGH) into this memory field. Later we wiil 
see that automated and sophisticated tools are avall- 
able to perform this microprogram writing. One such tool is 
AMDASM™ as available on System 29. But, this is not the only 
advantage of the microprogrammed architecture. 


As nobody is perfect, some “bugs” may inadvertently slip into 
the design. In a random logic architecture, we will have to re- 
design and usually rebuild the whole computer. On the other 
hand, in a microprogrammed machine it is usually sufficient to 
change a couple of bits in the microprogram to rectify the 
problem. This is even easier if a RAM instead of a PROM is 
used during the development and debugging phases. Of 
course, we must be able to load this memory with the micro- 
program by some extemal means. Again, a powerful tool is 
available: AMD's System/29™. : 


Finally, let's face the reality: The marketing guys usually 
change their requirements (i.e., the instruction set) when you 
are 80% through your logic design. Now you have to start 
over from scratch. Not sol Change some microcode, perhaps 
very litle hardware too and here you are! It is even more 
convenient when only additions to the existing instruction set 
are considered. Just add a few lines to your microprogram to 
comply with those new ideas! A mere few minutes using Sys- 
tem 29 — That's flexibility! Incidentally, don't tell the marketing 
guys how easy it is or you will NEVER get the product out!l 


SUMMARY 


The block diagram of Figure 13 shows a typical 16-bit 
minicomputer architecture. Also identified on this block dia- 
gram are various Am23900 family elements that might be used 
in each of these blocks. Such a design might use either 
4-Am2901A's or 4-Am2903's for the data path ALU. An 
Am2910 could be used as the microprogram sequencer for 
control of up to 4K words of microprogram memory. Also 
shown on the block diagram are the Am9130 and Am9140 
MOS Static RAM's which are potential candidates for use in 
the computer's main memory. 


The following chapters will discuss various blocks of Figure 13 
in detail and give design examples for each section. Needless 
to say, the design engineer can appropriately tailor any design 
to meet his throughput requirements. Also, special algorithms 
can be executed by adding the appropriate hardware and mi- 
crocode to the blocks described. 
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Figure 13. A Genoralizod Computer Architecture. 
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Chapter Il 
Microprogrammed Design 


CHAPTER II 
MICROPROGRAMMED DESIGN 


INTRODUCTION 


A microprogrammed machine is one In which a coherent se- 
quence of microinstructions is used to execute various com- 
mands required by the machine. if the maching is a computer, 
each sequence of microinstructions can be made to execute a 
machine instruction. All of the fitte elemental tasks performed 
by the machine in executing the machine instruction are called 
microinstructions. The storage area for these microinstructions 
is usually called the microprogram memory. This tochnique was 
identified by Wilkes in the 1950's as a structured upproach to 
the random control logic in a compute:. 


A microinstruction usually has two primary paris. These 
are: (1) the definition and control of all elemental micro- 
operations to be camed out ard (2) tha definition and control of 
the address of the next microinstruction to be executed. 


The definition of the various micro-operations to be cared out 
usually includes such things as ALU source operand selection, 
ALU function, ALU destinalion, cary control, shift control, inter- 
tupt control, data-in and data-out control and so forth. The def- 
inition of the next microinstruction function usually includes 
identitying the source selection of the next microinstruction ad- 
Gress, and in some ‘cases, supplying the actue! value of that 
microinstruction address. 


Microprogrammed machines are usually distinguished from 
non-m:croprogrammed machines in the fol’cowing manner. Old- 
ef, non-microprogrammned machines implemented the control 
function by using combinations of gates and flip-flops con- 
nected in a somewhat random fashion in order to generate the 
required timing and control signals for the machine. Micropro- 
grammed machines, on the other hand, are normally considered 
highly ordered and more, organized with regard to the control 
function field. In its simplest definition, a microprogram control 
unit consists of the microprogram memory and the structure 
required to determine the address of the next microinstructon. 


Microprogramming is normally selected by the design engineer 
@S a control technique for finite state machines because it im- 
proves flexibility, parlormence, and LSI utilization. Several addi- 
tional key features of microprogrammed designs are listed be- 
low: 


More structured organization 

Diagnostics can be implemented easily 

Design changes are simplia 

Field updates are easy 

Adaptatons are straightforward 

System definition can be expanded to include new features 
Documentation and Service are easier 

Design aids are available 

@ Cost and design time are reduced 


THE MICROPROGRAM MEMORY 


The microprogram memory is simply an N word by M bit mem- 
ory used to hold the various microinstructions. For an N word 
memory, the address locations are usually defined as location 
0 through N-1. For example, a 256-word microprogram mem- 
Ory will have address locations 0 through 255. Each word of 
the microprogram memory consists of M bits. These M bits are 
usually broken into various field definitions and the fields can 
consist of various numbers of bits. It is the definiton of the var- 
ious fields of a microprogram word that is usually referred to as 
FORMATTING 


An exampie of how mucroinstructon fields are defined in a typ- 
ical machine microprogram memory word is as follows: 


Field 1 — General purpose 

Fieid 2 — Branch address 

Field 3 — Next microinstruction address control 
Field 4 — Conditon code multiplexer contro! 
Field 5 — Interrupt contro} 

Field 6 — Fast cock/siow clock select 

Field 7 - Carty contro! 

Field 8 — ALU source operand control 

Field 9 - ALU function control 

Field 10 — ALU destination control 

Field 11 — Shifi multiplexer convol 

Fiaid 12 — etc. 


EXECUTING MICROINSTRUCTONS 


Once the microprogram format has been defined, it is neces- 
sary to execute sequences of these microinstructions if the 
machine is to perform any real function. In its simplest form, all 
that is required to sequence through a series of microinstruc- 
tions is a microprog:am address counter. The microprogram 
address counter simply increments by one on each clock cycle 
to select the address of the next microinstruction. For example, 
if the microprogram address ccunter contains address 23, the 
next clock cycle will increment the counter ard it will select ad- 
dress 24. The counter will continue to increment on each clock 
cycle thereby selecting address 25, address 26, address 27, 
and so forth. If this were the only control available, the machine 
would not be very flexible and it wou'd be able to execute only 
a fixed pattem of microinstructons. 


The technique of continuing from one microinstruction to the 
next sequential microinstruction is usually referred to as CON- 
TINUE. Thus, in microprogram contro! definition, we will use the 
CONTINUE (CONT) statement to mean simply incrementing to 
the next microinstructo1. 


MICROPROGRAN JUMPING 


if the microprogram contol unit is to have the ability to select 
other than the next microinstruction, the control unit must b3 
able to load a JUMP address. The load control of a counter 
can be a single bit feld within the microprogram word format. 
Let us call this one-bit field the microprogram address counter 
load enabie bit When this bit is at logic 0, a load will be inhib- 
ited and when this bit is a logic 1, a load will be enabdied. If the 
load is enabled, the JUIAP address contained within the micro- 
program memory will be parallel loaded into the microprogram 
address countor. This results in the ability to perform an N-way 
branch. For example, if the branch address field is eight bits wide, 
a JUMP to any address in the memory space trom word 0 through 
word 255 can be performed. 


This simple branching control feature allows a microprogram 
memory controller to execute sequential microinstructions or 
perform a JUMP (JMP) to any address either before or after 
the address currently contained in the microprogram address 
counter. 


CONDITIONAL JUMPING 


While the JUMP instruction has added some flexibility to the 
sequencing of microprogram instructions, the controller still 
lacks any decision-making capability. This decision-making 
capability is provided by the CONDITIONAL JUMP (COND 
JMP) instruction. Figure 1 shows a functional block diagram of 
a microprogram memory/address contsoller providing the capa- 
bility to jump on either of two different conditons. In this exan- 
ple, the load select control is a two-bit field used to contro! a 
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Figure 1. A Two-Bit Controi Field Can be Used 
to Select CONTINUE, BRANCH, or 
CONDITIONAL BRANCH. 


four-input muttiplexer. When the two-bit field Is equivalent to bi- 
Nary zero, the multiplexer selects the zero input which forces 
the load control inactive. Thus, the CONTINUE microprogram 
control instruction is executed. When the two-bit load select 
field contains binary one, the D, input of the multipiexer is 
selected. Now, the load control is a function of the Condition 1 
input. If Condition 1 is logic 0, the microprogram address 
counter increments and It Condition 1 is logic 1, the jump ad- 
dress will be paralte! toaded in the next clock cycle. This opera- 
tion is defined as a CONDITIONAL JUMP. If the load select 
input contains binary 2, the D2 input is selected and the same 
conditiona! function is performed with respect to the Condition 2 
input If the load select field contains binary 3, the D3 input of 
the multiptexer is selected. Since the D3 input is tied to logic 
HIGH, this forces the microprogram address counter to the load 
mode indepencent of anything else. Thus, the jump address is 
loaded into the microprogram address counter on the next 
Clock cycle and an UNCONDITIONAL JUMP is executed. This 
load se‘ect Control function definition is shown in Table 1. 


TABLE 1. 
LOAD SELECT CONTROL FUNCTION. 


Continue 


Jump Condition 1 True 
dump Condition 2 True 
1 Suursg2 Unconditional 


OVERLAPPING THE MICROPROGRAM 
INSTRUCTION FETCH 


Now that a few basic microprogram address contro! instructions 
have been defined, tet us examine the control instructions used 
in a microprogram control unit featuring the overlap fetching of 
the next microinstruction. This technique is also known as 
“pipelining”. The block diagram for such a microprogram con- 
trot unit is shown in Figure 2. The key difference when con- 
pared with previous microprogrammed architectures is the exis- 
tence of the “pipeline register” at the output of the microprogram 
memory. By definition, the pipeline register (or microword 
register) contains the microinstruction currently being executed 
by the machine. Simultaneously, while this microinstruction is 
being executed, the address of the next microinstruction is 
applied to the microprogram memory and the contents of that 
Memory word are being fetched and set-up at the inputs to the 
Pipeline register. This technique of pipelining can be used to 
improve the performance of the microprogram control unit. This 
results because the contents of the microprogram memory 
word required for the next cycle are being fetched on an over- 
lapping basis with the actual execution of the current muicro- 
program word. It should be realized that when the pipeline ap- 
proach is used, the design engineer must be aware of the lact 
that some registers contain the results of the previous mi- 
croinstruction executed, some registers contain the current mr 
croinsiructon being executed, and some registers contain data 
for the next microinstruction to be executed. 
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Figure 2 Overlapping (or Pipelining) the Fetch of the 
Next Microinstruction. 


Let us now compare the block diagram of Figure 2 with that 
shown in Figure 1. The major difference, of course, is the addi- 
ton of the pipeline register at the output of the microprogram 
control memory. Also, notice the addition of the address multi- 
plexer at the source of the microprogram memory address. 
This address multiplexer is used to select the microprogram 
counter register or the pipeline register as the source of the 
next address for the microprogram memory. The condition code 
multplexer is used to control the address multiplexer in this ad- 
dress selection. By placing an incrementer at the output of the 
address multiplexer, is is possible to always generate the cur- 
rent microprogram address “plus one" at the input of the micro- 
Program counter register. 


in Figure 1, the microprogram address counter was described 
&s a counter and could be a device such as the Am25LS161 
counter. In the _impiementation as shown in Figure 2, the 
Am25LS161 counter is not appropnate. Instead, an incrementer 
and register are useti to give the equivalent effect of a counter. 


The key difference between using a true binary counter and the 
incrementer register described here is as follows. When the 
jump address trom the pipeline register is selected by the mul- 
tplexer, the incrementer will combinatorially prepare that ad- 
Gress plus one tor entry into the microprogram counter register. 
This entry will occur on the LOW-to-HIGH transition of the 
Clock. Thus, the microprogram counter register can always be 
Made to contain address plus one, independent of the seleciion 
of the next microinstruction address. When the address multi- 
Plexer is switched so that the microprogram counter register is 
selected as the source of the microprogram memory address, 
the incrementer will again set-up address plus one for entry into 
the microprogram counter register. Thus, when the address 
multiplexer selects the microprogram counter register, the ad- 
dress multiplexer, incrementer and microprogram counter fea: 
ter appear to operate as a normal binary counter, 


The condition code multiplexer SS, operates in exactly the 
same fashion as described for the condition code multiplexer of 
Figure 1. That is, binary zero in the pipeline register (the cur- 
rent microinstruction being executed) forces an unconditional 
selection of tha microprogram register via Do. Binary one or bi 
Nary two in the next address select control bits of the pipeline 
register cause a conditional selection at the address multiplexer 
via D, or D2. Thus, a CONDITIONAL JUMP can be executed. 
* Binary three in the next address select portion of the pipeline 
register causes an UNCONDITIONAL JUMP instruction to be 
executed via D3. 


When the overall machine timing is studied, it will be observed 
that the key difference between overlap fetching and non- 
overlap fetching involves the propagation delay of the micro- 
Program memory. In the non-pipelined architecture, the micro- 
program memory propagation delay must be added to the 
Propagation delay of all the other elements of the machine. In 
the overlap fetch architecture, the propagation delay associated 
with the next microprogram memory address fetch is a sepa- 
rate loop independent of the other portion of the machine. 


SUBROUTINING IN MICROPROGRAMMING CONTROL 


Thus far, we have examined the CONTINUE instruction as weil 
as the CONDITIONAL and UNCONDITIONAL JUMP instruc- 
tions for overlap fetch. Just as in the programming of minicom- 
Puters and microcomputers, the advantages of SUBROUTIN- 
ING can be realized in microprogramming. The idea here, of 
Course, is that the same block of microcode (or even a single 
microinstruction) can be shared by several microinstruction 
Sequences. This results in an overall reduction in the total 


number of microprogram memory words required by the de- 
sign. If we are to jump to a subroutine, what is required is the 
ability to store an address to which the subroutine should retum 
when it has completed its execution. Examining the block dia- 
gram of Figure 3, we see the addition of a subroutine and loop 
(push/pop) stack (also called the fila) and its associated stack 
pointer. The contro! signals required by the stack are an enable 
stack signal (FILE ENABLE = FE) which will be used to tell the 
file whenever we wish to perform a push or a pop, and a 
push/pop control (PUP) used to control the direction of the 
stack pointer (push or pop). 


In this architecture, the stack pointer always points to the address 
of the fast microinstruction written on the stack. This al- 
lows the “next address multiplexer” to read the stack at any 
time via port F. When this selection is performed, the last word 
written on the stack will be the word applied to the T cropro- 
gram memory. The condition code multpiexer of the 2revious 
example has also been replaced by a next address or ‘Tol unit. 
This next address control unit (Am29811A) can execute 16 dif- 
ferent next address contro! functons where most of these func- 
tions are conditional. Thus, the device has four instruction in- 


puts as well as one conditon code test input which is con-- 


nected to the condition code multiplexer. Note also that the 
next address control field of the microprogram word has been 
expanded to a four-bit field. Outputs from the Am29811A next 
address control block are used to control the stack pointer and 
the next address multiplexer of the Am2911. Jn addibon, the 
device has outputs to control the three-state enable of the 
pipeline register and the three-state enable of the starting ad- 
dress decode PROM. Also, the archilecture has a counter it 
can be used as a loop-counter or event counter. 


The 16 instructions associated with the Am29811A are listed i in 


Table 2. As is easily seen by referring to Table 2, three of the . 
instructions in this set are associated with subroutining in mi- ~ 
croprogram memory. The first instruction of this set, is a simple” 


conditional JUMP-TO-SUBROUTINE where the source of the 
subroutine address is in the pipeline register. The RETURN- 
FROM-SUBROUTINE instruction is also conditional and is uSuu 


to retum to the next microinstruction following the JUMP-TO-_ 


SUBROUTINE instruction. There is also a conditional JUMP- 
TO-ONE-OF-TWO-SUBROUTINES, where the subroutine ad- 
dress is either in the PIPELINE register or in the internal REG- 
ISTER in the Am2911. This instruction will be explained in 
more detail later. 


**YPICAL COMPUTER CONTROL UNIT 
a*" CHITECTURE USING THE 
Am2911 AND Am29811A 


The microprogram memory contro! unit block diagram of Figure 
3 is easity implemented using the Am2911 and Am29811A. 
This architecture provides a structured state machine design 
capable of executing many highly sophisticated next address 
control instructions. The Am2911 contains a next address mul- 
tiplexer that pravides four ditferent inputs from which the ad- 
dress of the next microinstruction can be selected. These are 
the direct input (D), the register input (R), the program counter 
(PC), and the file (F). The starting address decoder (mapping 
PROM) output and the pipeline register output are connected 
together at the D input to the Am2911 and ara operated in the 
three-state mode. 


The architecture of Figure 3. shows an instruction register ca- 
pable of being loaded with a machine instruction word irom the 
data bus. The op code portion of the instruction is decoded 
using a mapping PROM to arrive at a starting address for the 
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TABLE 3. PIN FUNCTIONS. 


a 


Direct Input Bit i 
(natruction Bit i 
Condition Cada 
Condition Code Enable 


Direct input to register/counter and multiplexer. Og 1s LSB 
Selects one-of-sixteen instructions for the AM2910 

Used an test criterion. Pass test is a LOW on CC. 

Whenever the signal is HIGH, CC is ignored and the part operates , 


as though CC were true (LOW). 


Carry-In 
Register Load 


Low order carry input to incramenter for microprogram counter 
When LOW forces loading of register/counter regardless of 


instruction of condition 


Output Enable 


Clock Pulse 

+5 Volts 

Ground 

Microprogramn Address Bit | 


Three-state contro! of Y; ourputs 
Triggers all internal state changes at LOW-to-H!GH edge 


Address to microprogram memory. Yq is LSB. Y44 is MSc. 


Full Indicates that five iterns are on the stack 


Pipeline Address Enable 


Can select #1 source (usually Pipeline Register) at direct 


input source 


Map Address Enable 


Can select #2 source (usually Mapping PROM or PLA) as 


direct input source 


Vector Address Enabie 


Can select #3 source (for example, interrupt Starting Address) 


as direct inpul source 


microinstruction sequence required to execute the machine in- 
struction. When the microprogram memory address is to be the 
first microinstruction of the machine instruction sequence, the 
Am29811A next address contro! unit selects the multiplexer D 
input and enables the three-state output from the mapping 
PROM. When the current microinstruction being executed is 
selecting the next microinstruction address as a JUMP function, 
the JUMP address will be available at the multiplexer DO input. 
This is accomplished by having the Am29811A select the next 
address multiplexer D input and also enabling the three-state 
output of the pipeline register branch address field. The register 
enable input to the Am2911 is connected {o ground so that this 
register will always load the value at the Am2911 D input. The 
value at D is clocked into the Am2911’s register (A) at the end 
of the current microcycle, which makes the D value of this mi- 
crocycle available as the R value of the next microcycie. Thus, 
by using the branch address field of two sequential micro- 
instructions, a conditional JUMP-TO-ONE-OF-TWO- 
SUBROUTINES or a conditional JUMP-TO-ONE-OF-TWO- 
BRANCH-ADDRESSES can be executed by either selecting 
the D input or the R input of the next address multiplexer. 


When sequencing through continuous microinstructions in mi- 
croprogram memory, the program counter in the Am2911 is 
used. Here, the Am29811A simply selects the PC input of the 
‘next address multiplexer. In addition, most of these instructions 
enable the three-state outputs of the pipeline register as- 
sociated with the branch address field, which allows the register 
within the Am2911 to be loaded. . 


The 4 x 4 stack in the Am2911 Is used for looping and sub- 
routining in microprogram operations. Up to four levels of sub- 
routines or loops can be nested. Also, loops and subroutines 
can be intermixed as long as the four-word depth of the stack 
is not exceeded, ; 


ARCHITECTURE OF THE Am2910 


The Am2910 is a bipolar microprogram controller intended for 
use in high-speed microprocessor applications. It allows ad- 
dressing of up to 4K words of microprogram. A block diagram 
is shown in Figure 4, 


The controller contains a four-input multiplexer that is used to 
select either the register/counter, direct input, microprogram 
counter, or stack as the source of the next microinstructon ad- 
dress. 


The register/counter consists of 12 D-type, edge-triggered flip- 
flops, with a common clock enable. When its load control, RLD, 
is LOW, new data is loaded on a positive clock transition. A 
few instructions include load; in most systems, these instruc- 
tions will be sufficient, simplifying the microcode. The output of 
the register/counter is available to the multplexer as a source 
for the next microinstruction address. The direct input fumishes 
a source of data for loading the register/counter. 
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Figure 4. Am2910 Block Diagram. 
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The Am2910 contains a microprogram counter (uPC) that is 
composed of a 12-bit incrementer foowed by a 12-bit register. 
The »PC can be used in either of two ways. When tha carry-in 
to the wncrementer is HIGH, the microprogram register is loaded 
on the next clock cycle with the current Y output word plus cne 
(¥+1pPC) Sequential microinstructions are thus executcd. 
When the carty-in is LOW, the incrementer passes the Y output 
word unmodifed so that PC is reloaded with the same Y word 
on the next clock cycle (Y PC). The same microinstruction 
ts thus executed any number of times. 


The third Source for the muliipteaer is the direct (D) inputs. This 
Source is used for branching. 


The fourth source avaitab!e al the multiplexer input is a 5-word 
by 12-bit stack (fe). The stack is used to provide return ad- 
Gress linkage whe executing microsubroutines or loops. The 
Stack contains a buiid-in stack pointer (SP) which always points 
fo the last file word wnitten. This altows stack reference opera- 
bons (looping) to be performed without a pop. The stack pointer 
Operates as an up‘down counter. Dunng microinstructons 2, 4 
and 5, the PUSH oferaton is performed. This causes the stack 
pointer to incrementand the file to be written with the required 
return hnkage. On the cycle following the PUSH, the retum 
data ts at the new locaton pointed to by the stack pointer. 


During six other microinstructions, a POP operation occurs. 
Jhhis places the information at the top of the stack onto the Y 
Sutputs. The stack pointer decrements at the next nsing clock 
edge following a POP, elfoctively removing old information trom 
tne top of the stack 


The stack pointer linkage is such that any sequence of pushes, 
Pops or stack references can be achieved. At RESET {instruc- 
tion 0). the’ depth of nestng becomes zero. For each PUSH, 
the nesting depth increases by one; for each POP, the depth 
Gecreases by one. The depth can grow to five. After a depth of 
five ts reached, FULL goes LOW. Any further PUSHes onto a 
full stack overwrites information at the top of the stack, but 
leaves the stack pointer unchanged. This operation will usually 
Gestroy "useful information and is normally avoided. A POP trom 
an empty stack places non-meaningtul dala on the Y outputs, 
but is otherwise safe. The stack pointer remains at zero 
whenever a POP is attempted from a stack already empty. 


The registes/counter is operated during three anicroinstructions. 
(8, 9, 15) as a 12-bit down counter, with result = zero Byailabie 
as a microinstruction branch test critenon. This provides effi- 
Gent iteration of microinstructions. The fegister/counter is ar- 
ftanged such that if it is preloaded with a number N and then 
used as a loop termination counter, the sequence will be exe- 
cuted exactly N+1 times. During instruction 15, a three-way 
branch under combined contro! of the loop counter and the 
condition code is available. ’ 


The device provides three-state Y outputs. These can be par- 
ticularly useful in designs requiring automatic checkout of the 
processor. The microprogram controlier cutputs can be forced 
into the high-impedance state, and pre-programmed sequences 
of microinstructions can be executed via extemal access to the 
address lines. . 


OPERATION 


Tabie 4 shows the result of each instruction In controlling the 
multiplexer which determines the Y oufputs, and in controlling the 
three enable signals PL, MAP and VECT. The effect on thepPC, 
the register/counter, and the stack after the next positive-going 
Clock edge is also shown. The multiplexer determines which 
internal source drives the Y outputs. The value loaded Into wPC is 
either identical to the Y output, or else one greater, as determined 
by Cl. For each instruction, one and only one of the three outputs 
PL, MAP and VECT is LOW. If these outputs control three-state 
enables for the primary source of microprogram jumps (usualy 
part ol a pipeline register), a PROM which maps the instructon to 
a microinstruction starting location, and an optional third source. 
(often a vector from a DMA or interrupt source), respectively, the 
three-state sources can drive the D inputs without further logic. 


Several inputs, as shown in Table 4 can modify instruction execu- 
tion. The combination CT HIGH and CCEN LOWis used as atest 
in 10 of the 16 instructions. ALD, when LOW, causes the D input 
to be loaded into the register/counter, overriding any HOLD or 
DEC operation specified in the instruction. OE, normally LOW. 
may be forced HIGH to remove the Am2910 Y outputs ffom a 
three-state bus, 


TABLE 4. Am2910 MICROINSTRUCTION SET. 


CONDO JUMP PL 
PUSH/COND LO CATA 


CONO RIN 


COND JUMP PL & POP 
LD CNTR & CONTINUE 


TEST ENO LOOP 
CONTINUE 


THREE-WAY BRANCH 


PASS 


FAIL 
CEEN = Low and GC = HIGH | TCEN = HIGH of CO = LOW 


HOLO PL 


Note: 1f CCEN = LOW snd CC = HIGH, hold: else load. X = Don’t Care. 


The stack, a five-word last-in, first-out 12-bit memory, has a 
pointer which addresses the value presently on the top of the 
stack. Explicit control of the stack pointer occurs during instruc- 
tion 0 (RESET), which makes the stack empty by resetting the SP 
to zero. Atter a RESET, and whenever else the stack is empty, the 
content of the top of stack is undefined until a PUSH occurs. Any 
POPs performed while the stack is empty put undefined data on 
the F outputs and leave the stack pointer at zero. Any time the 
Stack is full (five more PUSHes than POPs have occurred since 
the stack was last empty), the FULL waming output occurs. No 
additional! PUSH should be attempted onto a full stack; if tried, 
information at the top of the stack will be overwritten and lost. 


THE Am2910 INSTRUCTION SET 


The Am2910 provides 16 instructions which select the address of 
the next microinstruction to be executed. Four of the instructions 
are unconditional — their effect depends only on the instruction. 
Ten of the instructions have an effect which is partially controlled 
by an extemal, data-dependent condition. Three of the instruc- 
tions have an effect which is partially controlled by the contents of 
the internal register/counter. The instruction set is shown in Table 
4. In this discussion it is assumed that Cl is tied HIGH. 


In the ten conditional instructions, the result of the data- 
dependenttestis applied to CC. Ifthe rete) input is LOW, the testis 
considered to have been passed, and the action specified in the 
Name occurs; otherwise. the test has failed and an alternate 


(often simply the execution of the next sequential microinstruc- 


tion) occurs. Testing of CC may be disabled for a specific micro- 
instruction by setting CCEN HIGH, which unconditionally forces 
the action specified in the name: that is, it forces a pass. Other 
ways of using CCEN Include (1) tying it HIGH, which is useful ifno 
microinstruction is data-dependent; (2) tying it LOW if data- 
dependent instructions are never forced unconditionally; or (3) 
tying it to the source of Am2910 instruction bit Ig, which leaves 
instructions 4, 6 and 10 as data-dependent but makes others 
unconditional. All of these tricks save one bit of microcode width. 


The effect of three instructions depends on the contents of the 
register/counter. Unless the counter holds a value of zero, it is 
decremented; if it does hold zero, itis held and a different micro- 
Program next address is selected. These instructions are useful 
for executing a microinstruction loop a known number of times. 
Instruction 15 is affected both by the external condition code and 
the internal register/counter. 

Perhaps the best technique for understanding the Am2910 is to 


simply take each instruction and review its operation. In order to 
provide some feel for the actual execution of these instructions, 


Figure 5 is included and depicts examples of all 16 instructions. _ 


The examples given in Figure 5 should be interpreted in the 
following manner: The intent is to show microprogram flow as 
various microprogram memory words are executed. For exam- 
ple, the CONTINUE instruction, instruction number 14, as shown 
in Figure 5, simply means that the contents of microprogram 
memory word 50 is executed, then the contents of word 51 is 
executed. This is followed by the contents of microprogram 
memory word 52 and the contents of microprogram memory word 
53. The microprogram addresses Used in the examples were 
arbitrarity chosen and have na meaning other than to show in- 
struction flow. The exception to this is the first example, JUMP 
ZERO, which forces the microprogram location counter to ad- 
dress ZERO. Each dot refers to the time that the contents of the 
microprogram memory word is in the pipeline register. While no 
special symbology is used for the conditional Instructions, the text 


to follow will explain what the coe choices are in each 
example. 


it might be appropriate at this time to mention that AMD ftlas a 
microprogram assembler called AMDASM, which has the capa- 
bility of using the Am2910 instructions in symbolic representa- 
tion. AMDASM's Am2910 instruction symbolics (or mnemonics) 
are given in Figure 5 for each instruction and are also shown in 
Table 4. 


Instruction 0, JZ (JUMP and ZERO, or RESET) ynconditionally 
specifies that the address of the next microinstruction is zero. 
Many designs use this feature for power-up sequences and pro- 
vide the power-up firmware beginning at microprogram memory 
word location 0. 


Instruction 1 is a CONDITIONAL JUMP-TO-SUBROUTINE via 
the address provided in the pipeline register. As shown in Figure 
§, the machine might have executed words at address 50, 51 and 
52. When the contents of address 52 is in the pipeline register, the 
next address control function is the CONDITIONAL JUMP-TO- 
SUBROUTINE. Here, if the test Is passed, the next instruction 
executed will be the contem of microprogram memory location 
90. if the test failed, the JUMP-TO-SUBROUTINE will not be 
executed; the contents of microprogram memory location 53 wilt 
be executed instead. Thus, the CONDITIONAL JUMP-TO- 
SUBROUTINE instruction at location 52 will cause the instruction 
either in location 90 or in focation 53 to be executed next If the 
TEST input is such that location 90 is selected, value 53 will be 


pushed onto the intemal stack. This provides the return linkage 


for the machine when the subroutine beginning at location 90 is 
completed. in this example, the subroutine was completed at 
location 93 and a RETURN-FROM-SUBROUTINE would be 
found at location 93. 


Instruction 2 is the JUMP MAP instruction. This is an uncondi- 
tional instruction which causes the MAP output to be enabled so 
that the next microinstruction Ie~ation is determined by the ad- 
dress supplied via the mapping PROMs. Normally the JUMP 
MAP instruction is used at the end of the instruction fetch se- 
quence for the machine. In the example of Figure 5, microinstruc- 
tions at locations 50, 51, 52 and 53 might have been the fetch 
sequence and at its completion at location 53, the jump map 
function would be contained in the pipeline register. This example 
shows the mapping PROM outputs to be 90; therefore, an uncon- 
ditional jump to microprogram memory address 90 is performed. 


Instruction 3, CONOITIONAL JUMP PIPELINE, derives its 
branch address from the pipeline register branch address value 
(BRg-BR,, in Figure 6). This instruction provides a technique for 
branching to various microprogram sequences depending upon 
the test condition inputs. Quite often, state machines are de- 
signed which simply execute tests on various inputs waiting for 
the condition to come true. When the true condition is reached, 
the machine then branches and executes a set of microinstruc- 
tions to perform some function. This usually has the effect of 
resetting the input being tested until some point in the future. 
Figure 5 shows the conditional jump via the pipeline register 
address at location 52. When the contents of microprogram 
memory word 52 are in the pipeline register, the next address will 
be either location 53 or location 30 in this example. If the test is 
passed, the value currently in the pipeline register (3) will be 
selected. If the test fails, the next address selected will be con- 
tained in the microprogram counter which, in this example, is 53. 


Instruction 4 is the PUSH/CONDITIONAL LOAD COUNTER in- 
struction and Is used primarily for setting up loops in micropro- 
gram firmware. In Figure 5, when instruction 2 is in the pipeline 
register, a PUSH will be made onto the stack and the counter will 
be loaded based on the condition. When a PUSH occurs, the 
value pushed is atways the next sequential instruction address. In 
this case, the address is 53. If the test fails, the counter is not 
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0 JUMP ZERO (JZ) 
\) 
1 
2 


3 COND JUMP PL (CP) 


NK 


6 COND JUMP VECTOR (CIV) 


8 REPEAT LOOP, CNTR # 0 (RFCT) 


STACK 
@usH) 


REGISTER) 
COUNTER 


exer ese8 


11 COND JUMP PL & POP (CPP) 


14 CONTINUE (CONT) , 


1 COND SSB PL (CJS) 


4 PUSH/COND LD CNTR (PUSH) 


STACK 
REGISTER’ 
COUNTER 


7 COND JUMP R/PL (JRP) 


sO 

st 

s2 

52 
70 Ly 
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9 REPEAT PL, CNTR ¥ 0 (RPCT) 


COUNTER 
(voc) 


12 LOCNTR & CONTINUE (LDCT) 


15 THREE-WAY er (Twa) 


STACK 
(PUSH) 


REGISTER/ 
COUNTER 


ed 


Figure 5. Am2910 Execution Examples. 


2 JUMP MAP (JMAP) 


5 COND JSB R/PL (JSRP) 


10 COND RETURN (CRTN) 


13 TEST END LOOP (LOOP) 


loaded; if it is passed, the counter is loaded with the value con- 
tained in the pipeline register branch address field. Thus, a single 
microinstruction can be used to set up a loop to be executed a 
specific number of times. Instruction 8 will describe how to use 
the pushed value and the register/counter for looping. 


Instruction 5 is a CONDITIONAL JUMP-TO-SUBROUTINE via 
the register/counter or the contents of the PIPELINE register. As 
shown in Figure 5, a PUSH is always performed and one of two 
subroutines executed. In this example, either the subroutine be- 
ginning at address 80 or the subroutine beginning at address 90 
will be performed. A retum-from-subroutine (instruction number 
10) retums the microprogram flow to address 55. In order for this 
microinstruction control sequence to operate correctly, both the 
next address fields of instruction 53 and the next address fields of 
instruction 54 would have to contain the proper value. Let's 
assume that the branch address fields of instruction 53 contain 
the value 90 so that it will be in the Am2910 register/counter wher. 
the contents of address 54 are in the pipeline register. This 
requires that instruction at address 53 load the register/counter. 
Now, during the execution of instruction 5 (at address 54), if the 
test failed, the contents of the register (value = 90) will select the 
address of the next microinstruction If the test input passes, the 
Pipeline register contents (value = 80) will determine the address 


of the next microinstruction. Therefore, this instruction provides _ 


the ability to select one of two subroutines to be execuede based 
on a test condition. 


Instruction 6 is a CONDITIONAL JUMP VECTOR instruction 
which provides the capabllity to take the branch address from a 
third source heretofore not discussed. tn order for this instruction 
to be useful, the Am2910 output, VECT, is used to control a 
three-state control input of a register, buffer, or PROM containing 
the next microprogram address. This instruction provides one 
technique for performing interrupt type branching at the micro- 
program level. Since this instruction is conditional, a pass causes 
the next address to be taken from the vector source, while failure 
causes the next address to be taken from the microprogram 
counter. In the example of Figure 5, if the CONDITIONAL JUMP 
VECTOR instruction is contained at location 52, execution will 
continue at vector address 20 if the TEST input is HIGH and the 
microinstruction at address 53 will be executed if the TEST. input 
Is LOW. 


Instruction 7 is a CONDITIONAL JUMP via the contents of the 
Am2910 REGISTER/COUNTER or the contents of the PIPELINE 
register. This instruction is very similar to instruction 5; the condi- 
tional jump-to-subroutine via R or PL. The major difference be- 
tween Instruction 5 and instruction 7 is that no push onto the stack 
ts performed with 7. Figure 5 depicts this instruction as a branch 
to one of two locations depending on the test condition. The 
example assumes the pipeline register contains the value 70 
when the contents of address 52 is being executed. As the 
contents of address 53 is clocked into the pipeline register, the 
value 70 is loaded into the register/counter in the Am2910. The 
value 80 is available when the contents of address 53 is in the 
pipeline register. Thus, control is transferred to either address 70 
or address 80 depending on the test condition. _ 


Instruction 8 is the REPEAT LOOP, COUNTER # ZERO instruc- 
tion. This microinstruction makes use of the decrementing capa- 
bility of the register/counter. To be useful, some previous instruc- 
tion, such as 4, must have loaded a count value into the register/ 
counter. This Instruction checks to see whether the register/ 
counter contains a non-zero value. If so, the register/counter is 
decremented, and the address of the next microinstruction is 
taken from the top of the stack. If the register counter contains 
Zero, the loop exit condition is occuring; control falls through to 


the next sequential microinstruction by selecting pPC: the stack 


is POP'd by decrementing the stack pointer, but the contents of y 


the top of the stack are thrown away. 


An example of the REPEAT LOOP, COUNTER # ZERO instruc- 
tion is shown in Figure 5. In this example, location 50 most likely 
would contain a PUSH/CONDITIONAL LOAD COUNTER?’ in- 
struction which would have caused address 51 to be PUSHed on 
the stack and the counter to be loaded with the proper value for 
looping the desired number of times. 


In this example, since the foop test is made at the end of the 
instructions to be repeated (microaddress 54), the proper value to 
be loaded by the instruction at address 50 is one less than the 
desired number of passes through the loop. This method allows a 
loop to be executed from 0 to 4095 times. 


Single-microinstruction loops provide a highly efficient capability 


for executing a specific microinstruction a fixed number of times. - 


Examples include fixed rotates, byte swap, fixed point multiply, 
and fixed point divide. 


Instruction 9 is the REPEAT PIPELINE REGISTER, COUNTER 
# ZERO instruction. This instruction is similar to instruction 8 
except that the branch address now comes from the pipeline 
register rather than the file. In some cases, this Instruction may be 
thought of as a one-word file extension; that is, by using this 
instruction, a loop with the counter can still be performed when 
subroutines are nested five deep. This instruction's operation is 
very similar to that of instruction 8. The differences are that on this 


instruction, a failed test condition causes the source of the next © 


microinstruction address to be the D inputs; and, when the test 
condition is passed, this instruction does not perform a POP 
because the stack is not being used. 


In the example of Figure 5, the REPEAT PIPELINE, COUNTER 
# ZERO instruction is instruction 52 and is shown as a single 
microinstruction loop. The address in the pipeline register would 
be 52. Instruction 51 in this example could be the LOAD 
COUNTER AND CONTINUE instruction (number 12). While the 
example shows a single microinstruction loop, by simply chang- 
ing the address in a pipeline register, multi-instruction loops can 
be performed in this manner for a fixed number of times as 
determined by the counter. 


Instruction 10 is the conditional RETURN-FROM-SUBROUTINE 
instruction. As the name implies, this instruction is used to branch 
from the subroutine back to the next microinstruction address 
following the subroutine call. Since this instruction is conditional, 
the retum Is performed only if the test is passed. If the test is 
failed, the next sequentia! microinstruction is performed. The 
example in Figure 5 depicts the use of the conditional RETURN- 
FROM-SUBROUTINE instruction in both the conditional and the 
unconditional modes. This example first shows a jump-to- 
subroutine at instruction location 52 where control Is transferred 
to location 90. At location 93, a conditional RETURN-FROM- 
SUBROUTINE instruction is performed. If the test is passed, the 
stack is accessed and the program will transfer to the next instruc- 
tion at address 53. If the test is failed, the next microinstruction at 
address 94 will be executed. The program will continue to ad- 
dress 97 where the subroutine is complete. To perform an un- 
conditional RETURN-FROM-SUBROUTINE, the conditional 
RETURN-FROM-SUBROUTINE instruction is executed uncon- 
ditionally; the microinstruction at address 97 is programmed to 
force HIGH, disabling the test and the forced Foon 


.gauses an unconditional return. 
Instruction 11 is the CONDITIONAL JUMP PIPELINE register 


address and POP stack instruction. This instruction provides 
another technique for loop termination and stack maintenance. 
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The example in Figure 5 shows a loop being performed from 
address 55 back to address 51. The instructions at locations 52, 
53 and 54 are all conditional JUMP and POP instructions. At 
address 42, if the TEST input is passed, a branch will be made to 
address 70 and the stack will be properly maintained via a POP. 
Should the test fail, the instruction at location 53 (the next sequen- 
tal mstruction) will be executed. Likewise, et address 53, either 
the instruction at 90 or 54 will be subsequently executed, respec- 
tive to the test being passed or failed. The instruction at 54 follows 
the same rules, going to either 80 or 55. An instruction sequence 


&s descnbed here, using the CONDITIONAL JUMP PIPELINE ° 


and POP instruction, is very useful when several inputs are being 
tested and the microprogsam is looping waiting for any of the 
inputs being tested to occur before proceeding to another se- 
quence of instructions. This provides the powerful: jump-table 
programming technique at the firmware level. 


Instructon 12 is the LOAD COUNTER AND CONTINUE instruc- - 


ton, which simply enables the counter to be loaded with the value 
at its parallel inputs. These inputs are normally connected to the 
Pipeline branch address field which (in the architecture being 
described here) serves to supply either a branch address or a 
counter value depending upon the microinstruction being exe- 
cuted. There are altogether three ways of loading the counter — 
the exphcrt load by this instruction 12; the conditional foad in- 
Cluded as part of instruction 4; and the use of the RLD input along 
with any instruction. The use of ALD with any instruction over- 
ndes any counting or decrementation specified in the instruction, 
Calling for a load instead. Its use provides additional microinstruc- 
ton power, at the expense ol one bit of microinstruction width. 
This instruction 12 ts exactly equivalent to the combination of 
instruction 14 and RUD LOW. Its purpose is to provide a simple 
Capability to load the register/counter in those implementations 
which do not provide microprogrammed control for ALD. 


Instruction 13 is the TEST END-OF-LOOP instruction, which 
Provides the capability of conditionally exiting a loop at the bot- 
tom; that is, this is a conditional instruction that will cause the 
microprogram to loop, via the file, if the test is failed else to 
contnue to the next sequential instruction. The example in Figure 
5 shows the TEST END-OF-LOOP microinstruction at address 
56. If the test fails, the microprogram will branch to address 52. 
Address 52 is on the stack because a PUSH instruction had been 
executed at address 51. If the test is passed at instruction 56, the 
loop is terminated and the next sequential microinstruction at 
address 57 is being executed, which also causes the stack to be 
POPd; thus, accomplishing the required stack maintenance. 


Instruction 14 is the CONTINUE instruction, which simply causes 
the microprogram counter to increment so that the next sequen- 
tal micromstruction is executed. This is the simp!est microinstruc- 
tion of all and should be the detault instruction which the firmware 
requests whenever there is nothing better to do. 


Instruction 15, THREE-WAY BRANCH, is the most complex. it 
provides for testing of both a data-depenrent condition and the 
counter dunng one microinstruction and provides for selecting 
among one of three microinstruction addresses as the next mi- 
croinstruction to be performed. Like instruction 8, a previous 
instruction will have loaded a count into the register/counter while 
pushing a microbranch address onto the stack. Instruction 15 
performs a decrement-and-branch-until-zero function similar to 
instruction 8. The next address is taken from the top of the stack 
until the count reaches zero; then the next address comes from 
the pipeline register. The above action continues as jong as the 
test condition fails. if at any execution of instruction 15 the test 
condition is passed, no branch is taken; the microprogram 
counter register furnishes the next address. When the loop is 


ended, either by the count becoming zero, or by passing the 
conditional test, the stack is POP’d by decrementing the stack 
pointer, since interest in the value contained at the top of the stack 
is then complete. 


The application of instruction 15 can enhance performance of a 
variety of machine-level instructions. For instance, (1) a memory 
search instruction to be terminated either by finding a desired 
memory content or by reaching the search limit; (2) variable- 
field-length arithmetic terminated early upon finding that the con- 
tent ol the portion of the field still unprocessed is all zeroes; (3) 
key search in a disc controller processing variable length records; 
(4) normalization of a floating point number. 


As one example, consider the case of a memory search instruc- 
tion. As shown in Figure 5, the instruction at microprogram ad- 
dress 63 can be Instruction 4 (PUSH), which will push the value 
64 onto the microprogram stack and load the number N, whichis 
one less than the number of memory locations to be searched 
before giving up. Location 64 contains a’ microinstruction which 
fetches the next operand from the memory area to be searched 
and compares it with the search key. Location 65 contains 4 
microinstruction which tests the result of the comparison and also 
is aTHREE-WAY BRANCH for microprogram control. Ifno match 
is found, the test fails and the microprogram goes back to location 
64 for the next operand address. When the count becomes zero, 
the microprogram branches to location 72, which does whatever 
is necesSary if no match is found. If a match occurs on any 
execution of the THREE-WAY BRANCH at location 65, control 
falls through to location 66 which handles this case. Whether the 
instruction ends by finding a match or not, the stack will have 


been POP'd once, removing the value 64 from the top of the 
stack. ’ 


Am29811A Instruction Set Difference 


The Am29811A instruction set is identical to the Am2910 except 
tor instruction number 15. in the Am29811A, instruction number 
15 is an unconditional JUMP’PIPELINE REGISTER instruction. 
This provides the ability to unconditionally branch to any address 
contained in the branch address field of the microprogram. Thus, 
an unconditional N-way branch can be performed. Use of this 
instruction as opposed to a forced conditional jump pipeline in- 
Struction simply allows the condition code multiplexer select field 
to be shared (formatted) with other functions. 


TYPICAL COMPUTER CONTROL UNIT ARCHITECTURE 
USING THE Am2910 


The microprogram memory contro! unit block diagram of Figure 6 
is easily implemented using the 4m2910. This architecture pro- 
vides a structured state machine design capable of executing 
many highly sophisticated next address control Instructions. 


The architecture of Figure 6 shows an instruction register capable 
of being toaded with a machine instruction word from the data 
bus. The op code portion of the Instruction is decoded using a 
mapping PROM to arrive at a starting address for the mi- 
Ccroinstruction sequence required to execute the machine Instruc- 
tion. When the microprogram memory address is to be the first 
microinstruction of the machine Instruction. sequence, the 
Am2910 next address contol selects the multiplexer D input and 
enables the three-state output from the mapping PROM. When 
the current microinstruction being executed is selecting the next 
microinstruction address as a JUMP function, the JUMP address 
will be available at the multiplexer D input. This is accomplished 
by having the Am2910 select the next address multiplexer D input . 
and also enabling the three-state output of the pipeline register 
branch address field. The register enable input to the Am2910 
can be grounded so that this register will load the value at the 
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Am2910 D input. The value at D is clocked into the Am2910's 
register (R) at the end of the current microcycle, which makes the 
D value of this microcycle available as the R value of the next 
microcycle. Thus, by using the branch address field of two se- 
Quential microinstructions, a conditional JUMP-TO-ONE-OF- 
TWO-SUBROUTINES or a conditional JUMP-TO-ONE-OF- 
TWO-BRANCH-ADDRESSES can be executed by either se- 
lecting the D input or the R input of the next address multiplexer. 


When sequencing through continuous microinstructions in mi- 
Croprogram memory, the program counter in the Am2910 Is used. 
Here, the control logic simply selects the PC input of the next 


address multiplexer. in addition, most of these instructions ena-. 


ble the three-state outputs of the pipeline register associated with 
the branch address field, which allows the register within the 
Am2910 to be loaded. The 5 x 12 stdck in the Am2910 is used for 
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fooping and subroutining in microprogram operations. Up to five 
levels of subroutines or loops can be nested. Also, ‘loops and 
subroutines can be intermixed as long as the five word depth of 
the stack is not exceeded. 


CCU TIMING 


The minimum clock cycle that can be used in a CCU design is 
usually determined by the component delays along the longest 
“pipeline-register-clock to logic to pipeline-register-clock” path. 


. At the beginning of any given clock cycle, data available at the 


output of the microprogram memory, counter status, and any 
other data and/or status fields, are latched into their associated 
pipeline registers. At this point, ali delay paths begin. Visual 
inspection will not always point out the longest signal delay path. 


Figure 6. A Typical Computer Controj Unit Using the Am2910. 
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The ob-rausly long paths are a good piace to start, but each 
detnab’e path shou'd be calculated on a component by compo- 
hent basis untl dhe truly longest logic signal path is found 


Relerning to Figure 6, a numter of potentially jong paths can be 
identified. These include the instruction reg:ster to pipeline regrs- 
ier time, the fipeline register to pipelne register time via the 
coadit.on code muitip!exer and the status to pipeline register hme. 
In order to demonstrate the technique for calculating the AC 
periormance of the Am2910 state machine design, the timing 
diagrams of Figure 7 are presented. Here, a number of propaga- 
ton de!ay paths are evaluated such that the reader can learn the 
techn.que for performing these computations. 


Alt of the propagation delays have been caiculated using typical 
propagation delays because atthe time of this writing, the charac- 
tenzation of the Am2910 has not been completed. When the final 
data sheet is pub!.shed, the user need only select (he appropriate 
worst case specilications and he can compute the desired 
max:mum propagation delays for his design. Also, by looking at 
the typical propagation delay numbers, the designer will be able 
to evaluate the design margin in the system after he has com- 
nleted ail of the worst case calculations. These typical prop- 

galion de:ays represent the expected values if a system were 
set up on the bench and actual measurements would be Laken al 
5V and 25°C operating tempereture. 


While Figure 6 and Figure 7 deal with the Am2910 microprogram 
sequencer, itts also instructive to evaluate the AC performance of 
@ typical computer contro! unit using the Am2911 and 
Am29811A Figure 3 shows such a connection and will be used 
as the basis for performing the propagation delay path calcula- 
tions. The calculatvons for the vanous propagation delay paths 


are demonstrated in Figure 8 and are intended to show the ' 


technique for computing these delays. As before, the typical 
propagation delays have been used in the computation for com- 
parison purposes. The user can derive the maximum numbers at 
25°C and 5V, commercial temperature range and power supply 
vanations or military temperature range and power supply varia- 
tions as required for his design. 


When Figure 7 and Figure 8 are reviewed in detail, the reader will 
recognize that the longest propagation delay paths in the case of 
the Am2910 as well as the Am2911 and Am29811A involve the 
three-state enables on the map PROM or the pipeline register for 
the branch address. If absolute maximum speed is desired, these 
paths can be eliminated by using one of several techniques. One 
techn:que is to simply allocate one or more bits in the pipeline 
register to control the three-state enables of the various devices 
connected to the D input of the Am2910. For the example of 
Figure 6, one bit would be sufficient and the pipeline register 
could be implemented using an Am74S175 register. This would 
allow the true and complement outputs to be used to drive the 
pipeline register branch address output enable and the mapping 
PROM output enable. Thus, these longest paths would be elimi- 
nated and an improvement of about 30ns would be achieved. A 
second technique for eliminating these propagation delay paths 
would be to use a four input NAND gate and a four input NOR gate 
to encode the equivalent function of the MAP enable and the PL 
enable. This technique is demonstrated in Figure 9. Again, an 
Am74S175 register would be used as the pipeline register to 
provide the Instruction inputs to the Am2910 sequencer. This 
would allow instruction 2 to be decoded to provide the MAP 
enable signal and “NOT INSTRUCTION 2" to be decoded as the 
pipeline enable signal. This technique can be applied as weil to 
the computer control unit of Figure 3 to accomplish the same 
longest path elimination. 
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Figure 7. Propagation Delay Calculations on the Am2910 Microprogram Sequencer. 
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Figure 7. Propagation Delay Calculations on the Am2910 Microprogram Sequencer (Cont). 
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Figure 7. Propagation Delay Calculations on the Am2910 Microprogram Sequencer (Cont.). 
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Figure 8. Propagation Delay Calculations for the Am2911 and Am29811A Design. 
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Figure 8. Propagation Delay Caiculations for the Am2911 and Am29811A Design (Cont.). 
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Figure 8. Propagation Delay Catculations for the Am2911 and Am29811A Design (Cont). 
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In order to compare the performance of the Am2910 with the 
Am2911 and Am29811A, Table 5 is presented. Here the prop- 
agation delays for the Am2911 and Am29811A are for a 12-bit 
wide microprogrum sequencer configuration. If a wider configu- 
ration is used, only one additional carry input to carry output delay 
must be added to the appropriate paths of these calculations. A 
12-bit wide Am2911/29811A configuration has been evaluated 
so that an “apples to apples" comparison can be made. 


Asis shown in Table 5, anumber of combinations are possible for 
the longest AC propagation delay paths for these microprogram 
sequencers. First, the continue instruction can be executed the 
fastest of any of the microprogram instructions if the continues 
are sequeniial. That is, from the second continue on, the typical 
microcycie can be either 61 or 64ns respectively. To achieve this 
speed, it is required that various signals throughout the architec- 
ture be stable such that the only paths that enter into the propaga- 
tion delay calculation are the clock-to-output of the microprogram 
counter, the microprogram memory and the pipeline register 
setup. 


The second group of instructions shown in Table § show some 
examples of instruction execution and jumping. These examples 
assume that the MAP and OE outputs are not used as described 
earlier. These calculations apply to several of the instructions but 
not to all the instructions. For the Am2910 sequencer all of the 
propagation delays are around 80 to 85ns; while for the 
Am2911/Am29811A combination, the propagation delays range 
from about 8Ons to 100ns, depending on the instruction. It should 
be noted that certain other instructions such as push and condi- 
tional load counter should be evaluated to determine the speed at 
which they can be executed. 


The last two instructions shown in Table 5 are for jumps where the 
output enable of the field supplying the address to the D inputs of 
the microprogram sequencers are controlled by either the 
Am2910 or Am29811A. Notice that for Am2910 configuration, the 
jump map represents the longest propagation delay path and is 
103ns typical. Also, for the Am2911/Am29811A combination, the 
jump map instruction also represents the longest propagation 
delay path and is 109ns typical. 


It is not the purpose of this exercise to show every possible 
propagation delay path; but rather, to show the reader the 
technique for computing propagation delays such that any design 
can be evaluated and the worst case past derived. Even here, not 
all of the worst case numbers shown in Table 5 have been derived 
in Figures 7 and 8. This was done intentionally and is lett as an 
exercise for the student. 


Ifthe Am2909 or Am2911 and the Am29811A are combined into 
microprogram sequencers of either 8 bits in width or 16 bits in 
width, the calculations need only be modified slightly to determine 


the microcycle times. Obviously, if two Am2911s are used, the 


worst case propagation delay paths do not change. However, if - 


four Am2911s are used, the carry path will become the longer 
Propagation delay path on several of the computations. This may 
be offset however since larger microprogram PROMS may be 
used if 64K of microcode is actua'ly being addressed or high 
power buffers may be placed between the Am2911 outputs and 
the microprogram memory to provide sufficient drive for such a 
large microprogram store. 


In addition, the Am2909 and Am2911 may be used without the 
Am29811A where the user wishes to generate a special purpose 
instruction set or very high speed control of the intemal multi- 
plexer and push pop stack. In some, designs as much as 25 to 
30ns, typical, can be removed from the longest propagation delay 
paths of the design by using high speed Schottky SSI. While this 
has not been the typical case, some designers have used it to 
provide a performance improvement not achievable with a stan- 
dard Schottky condition code multiplexer and the Am29811A 
next address control unit. 


APPLICATIONS 


It should be understood that the microprogram state machine 
built using either the Am2910 or the Am291 1/2981 1A represents 
a general purpose state machine controller. Applications for this 
type of microprogrammed control include uses in minicomputers, 
communications, instrumentation, controllers and peripherals as 
well as special purpose processors. Typically, the micropro- 
grammed approach provides a more structured organization to 
the design and allows the design engineer the greatest flexibility 
in implementation. 


It is important to understand that microprogrammed riachiiee 
need not be part of a typical minicomputer type structure. That is, 
a general purpose minicomputer usually has a machine instruc- 
tion set that is totally different from its microprogram instruction 
control. As such, it is essential thai the designer new to computer 


_ Gesign and microprogram design understand the difference be- 


tween a machine ‘nstruction and a microprogram instruction. This 
differentiation is shown in Figure 10 where a typical 16-bit 
machine level instruction is demonstrated as compared with a 
typical microprogram instructon. The machine level instruction 
uSually consists of 16 bits and in this example, these bits are used 
to provide the op code, source register definition and destination 
register definition. The microprogram instruction on the other 
hand usually consists of anywhere from 32 to 128 bits in a typical 
minicomputer type design. Here, the bits are used to control the 
elemental functions of a machine such as the Am2910 instruction 
control and condition code mullipiexer, the Am2903 source, ALU 
function and destination control and so forth. For purposes of this 
explanation, let us assume that the machine level instruction is 
available to the machine programmer while the microprogram 


TABLE 5. SUMMARY OF LONGEST AC PATHS FOR MICROPROGRAM SEQUENCERS. 
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Figure 10. Understanding Machine and Microprogram Instructions. 


instruction is not available to the machine programmer at the 
assembly language level. Letit suffice to say that this assumption 
§S not necessarily valid in machines being designed today. 


Perhaps one of the most typical applications of tha micropro- 
grammed computer control unit state machine design is as the 
controller for a minicomputer. Here, the function of the micropro- 
grammed controler is to fetch and execute machine level instruc- 
tons. The flow required to perform this function is depicted in 
F-qure 114 which should be representative for all general purpose 
«ype machines. Figure 11 shows that after initialization, the com- 
puter contyol unit simply fetches machine instructions, decodes 
these instructions and then fetches the required operands such 
that the original instruction can be executed. This cycle of fetch- 
ing and executing instructions is performed without end. Such 
things as hardware halts or resets are ignored and should be 
assumed to only cause re-initialization. 


Once the flow of a typical computer control unit is understood, itis 
possible to evaluate a number of architectures using the Am2910 


or Am2911/Am29811A such that the flow diagram of Figure 11 
can be implemented. 


STATE MACHINE ARCHITECTURES 


After a machine instruction is fetched from memory, it is normally 
placed in the machine instruction register as described in Figure 
6. Then the op code portion of the instruction is decoded so that a 
sequence of microinstructions in the microprogram memory can 
be selected for execution. Each microinstruction is fetched and its 


contents placed in the pipeline register as shown in Figure 6 for 
execubon. 


While the architecture of Figure 6 is recommended and has been 
used throughout the preceding portion of this chapter, it should be 
understood that a number of architectures are possible using 
these microprogram sequencers. The normal flow in fetching 
mucroinstructions is to determine the address of the next mi- 
croinstruction, fetch the contents at that address and set up this 
data at the input of the pipeline register such that it can be clocked 
into the pipeline register for execution. If we assume that a clock 
is being used to clock the pipeline register, the Am2910, the 
machine instruction register and the Am2903 microprocessor bit 
sbees. it is possible to define a number of computer control unit 
designs where the relabonship between the clock edges is dif- 
ferent. 


There seem to be a minimum of seven different architectures that 
can be defined based on placing registers in the appropriate 
signal paths and stoning data on the low-to-high transition of the 


Figure 11. Computer Control Flow Diagram. 


Clock. For purposes of this discussion, we will assume that all 
clocked devices will operate using the same clock such that 
changes will occur on the LOW-to-HIGH transition of the clock. 
While it is possible to use multiphase clocks and tie different clock 
phases to different devices, that type of system operation will not 
be described here. In all cases, we will be talking about the flow of 
signals between LOW-to-HIGH transitions of the clock. Typically, 
acycleis started by a clock edge at a device and the signals begin 
to flow from one device to the next untii a sel-up time to a clock 
edge resulis. Then, the next microinstruction is executed in 


exactly the same manner. There are three different identifiable 
types of microinstruction sequences where only one register is in 
the signal flow loop. The first of these we shall call an Address- 
Based microinstruction cycle. It usually starts with the address of 
a microprogram memory word being stored in a register by the 
clock. This address has been determined by the previous mi- 
croinstruction. This address then accesses the microprogram 
memory to fetch its contents which are presented at its outputs to 
control the Arithmetic Logic Unit and the results of the Arithmetic 
Logic Unit function may be used to determine the next address 
selected that will be stored in this microprogram address register. 
This is shown as Figure 12a. The second type of microprogram 
architecture is called Instruction-Based. Here, the register is 
piaced at the output of the microprogram memory as shown in 
Figure 12b. Again, the cycle consists of executing the mi- 
croinstruction in the ALU; perhaps using the results of the opera- 
tion to determine the address of the next microinstruction and 
then fetching the contents of that microinstruction and setting this 
new data up at the input to the register. The third basic architec- 
ture for microprogram control is called Data-Based. Here, arec::- 
ter is used to hold the status data from the ALU and this is the 
determining clock point for the cycle. Here, the status register 
iniwates the selection of the next address from which the micro- 
programmed data is fetched and this microprogram i’-struction is 
used to execute a new function in the ALU thereby setting up the 
results for the status register. This scheme is shown in Figure 
12c. Note that this scheme requires an additonal register at the 
output of the microprogram memory to hold a portion of the 
microprogram instruction for controlling the condition code mul- 
tiplexer and Am291Q instruction set. These primitive architec- 
tures fur microprogrammed control demonstrate the three points 
at which a register can be placed to provide a start and an end for 
the microcycle. In a general sense, each of these three architec- 


(a) Addressed Based 


tures is one level pipelined. This, however, is not the definition 
normally associated with pipelining of microprogram control. 


It combinations of the above described archrtectures are Im- 
plemented, an improvement in performance will be realized. In 
each of the three architectures thus described (address-based, 
instruction-based, and data-based), all of the signal paths are in 
series and must be transcended before a microcycle can be 
completed. They are quite easy to program, however, since ail of 
the tasks are completed in the loop before proceeding to the next 
microinstruction. As stated earlier, these tend to be the slowest of 
the possible architectures for microprogram control. This disad- 
vantage can be overcome by using a technique referred to as 
pipelining in microprogram control. In a pipeline architecture, we 
overlap the fetch of the next microinstruction while we are exe- 
cuting the current microinstruction. This is achieved by inserting 
additonal registers in the overail path such that we can hold the 
Signals step-by-step. There are three possible combinations of 
the above mentioned architectures that can he utilized in micro- 
program control. These are address-ins:-uction-based, 
address-data-based, and instruction-data-based. v/hile each of 
these represent two stages of pipelining, we normally refer to 
these as the pipelined architectures. These are shown in Figure 
12d, 12e and 12f. itis the instruction-data based architecture that 
is recommended for the Am2910 and prowdes the overall best 
trade-off in cost versus performance. = 


The last possible architecture using registers in the signal path is 
a combination of all three architectures and is called address- 
instruction-data-based microorogram control and 1s shown in 
Figure 12g. Here, three stages of pipeline are involved and we 
normally refer to this as two-level pipelined archiecture. Needless 
to say, if no pipelining were involved at all, we would have a ring 
oscillator. 


(b) Instruction Based 


Shaded Lines Show Required Signal Flow to Complete a Microcycls: 
Determine Address, Fetch Instruction and Execute. 
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(@) Instruction-Data Based (e) Instruction-Address Based 


Shaded Lines Show Required Signal Flow to Complete a Microcycle: 
Determine Address, Fetch Instruction and Execute. 


Figure 12. Standard Microprogram Control Architectures (Cont.). 


(f) Address-Data Based 


MAP cLock 


(g) Instruction-Address-Data Based 


map 


Shaded Lines Show Required Signal Flow to Complete a Microcycle: 
Determine Address, Fetch Instruction and Execute. 


Figure 12. Standard Microprogram Control Architectures (Cont.). 


The advantace of the instruction-data-based architecture is that 
the address and contents of the next microinstruction aro beng 
fetched while the cure: t microinsiruction in the pipeline register 
(Figure 6) is being executed. This allows a shorter microcycle 
since the microprogram memory ictch <.nd ALU execution can be 
operated in parallel The results “f this type operation ave dem- 
onstrated in Figure 13 where we see a typica' timing diagram of 
the microprogram executiun of the address-da!a-bascd :nstruc- 
tion architeciure. is snouid be noted tna’ when tre comtutarur al 
aspects 2! a microinstruction, are rot comp.eted ty ch. same 
microcycle. they obvicus.y cannct be used to determine the id- 
dress of another mic: ocycie until the computziion has beat ccm- 
pleted ard stored ti ces status reg.sier. Thus, this pipelined 
architec‘ure cHers s:g7:f can’ speed improve rent except in the 
Case O' certain cond'ton:.! »ymps Inc ther words, 112 ecnditona 
jump may rot .:se the status reg’sie’ 1::0:mation of te m- 
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mediately preceding microinstruction because the computation ‘s 
just being performed. For this architecture, the conditona! janp 
fetch musi be execu‘ed on the cycle after the stalus recitter 
contains the proper execution esults. This can be seen by utucy- 
ing Figure 13. In most micropiogram designs this .s no! a d :ud- 
vantage because o:-er house.eeping and ALU cperaticns can 
be performed whi'e the address of the next mucroinstruc' or ts 
be ng det 3rmined us.ng the current contents of the status evis- 
ter. While it is not drectiy pertinent to the discussion at ts ume, 
etus point out that the Am2°. 4 has bee des gned such :h1: ‘he 
machine arch.tec’ can u |.:e both instructon-data-base.. ar 

chitecture as well =s instr.cuon-based architec'ure {no ho se- 
keeping is required. Thus, the Am29+0 and Am2904 can be: sed 
-n avariab!e architec' ure cycle to ach.eve maximum -e-fo°m:nez 
fo the macine. 


rE"CH "cH 
obeth ig pe INSTI+, Ah 14 
aN he? wihS? +93 ; 
‘ 
FYE. "| EXECUTE EXEC 7 
aM t | piNST 102 we Mot 9 
mseoaer | aerur- RESU: 1 2+ | 
—e Peet ba | ' wee sTae2 


+g D.a rar of Mic cpica,. m Exec.t 


38 


QQ, Oz Q; G& Oy O% 


o 
STATUS REG 


Dy Dy, 0; 0; Dy Dy 0 


Figure 14. Typical Am2910 Microprogram Control Unit. 


The Am2910 in Computer Control 


A general state machine design using the Am2910 Is shown in ° 
Figure 14. Here, all three output enables are used to advantage in . 


order to control the mapping PROM, pipeline register and vector 
PROM in this design. This design Is very straightforward and in 
fact is identical to that shown earlier. 


One area that should not be overlooked is that of initializing the 
Am2910 at power up. One technique for accomplishing this is to 
use a pipeline register with a clear input to provide all LOWs to the 


* instruction inputs of the Am2910. This will cause a reset of the 


Stack in the’Am2910 and force the outputs to the zero word and 
microcode which can be used for the initialization routine. Typi- 
cally, power up will result in the firing of a timer which can be 
connected to the clear input of the register. Figure 15 shows the 
technique for initializing the Am2910 using this method. 


One advantage of the Am2909 when compared to either the 
Am2310 of Am2911 is the OR inputs to the microprogram ad- 
dress field. These QR inputs allow two, four, eight or 16-way 
branching for each device if proper control is used. This contro! 
can be accomplished using the Am29803A, 16-way branch con- 
trol unit A typical computer contro! unit using the Am29809, 
Am2911, Am29803A and Am29811A is shown in Figure 16. In 
this example, the feast significant microprogram control se- 
quencer is an Am2909 and the two more significant sequencers 
are Am2911s. 
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Figure 15. Initializing the Am2910. 
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Figure 16. A High Performance Microprogram Controller Using the Am2909, Am29811A and Am29803A. 


DETAILED DESCRIPTION OF THE Am2911 AND 
Am29811A IN A COMPUTER CONTROL UNIT 


The detailed connection diagram of a straight-forward computer 
control unit is shown in Figure 17. This design features all of the 
next address contro! functions described previously and a few 
features have also been added. 


“Referring to Figure 17, the instruction register consists of two 
Am25LS377 Eight-Bit Registers with Clock Enable. These reg- 
isters are designated as U1 and U2 and provide ability to selec- 
tively load a 16-bit instruction. This particular design assumes 
that the instruction word consists of an eight-bit op code as well as 
eight bits of other data. Therefore, the op code is decoded using 
three 256-word by 4-bit PROMs. The Am29761 has been 
selected for this function and is shown in Figure 17 as U3, U4 and 
US. 


The basic control function for the microprogram memory Is pro- 
vided by the Am2911s. In this design, three Am2911s (U6, U7, 


Note: Figures 17, 18, 20, and 24 are at back of the book. 


and U8) are used so that up to 4K words of microprogram mem- 
ory can be addressed. The microprogram memory can consist of 
PROMs, ROMs, or RAMs, depending on the particular design 
and the point of its development. This particular design shows the 
capability of a 64-bit microword; however, the actual number of 
bits used will vary from design to design. 


The pipeline register associated with the computer contro! unit 
cohsists of five integrated circuits designated U16, U17, U18, 
U19 and U20. ; 


One of the features of the architecture depicted in Figure 17 is the 
event counter shown as U9, U10 and U11. This event counter 
consists of three Am2SLS163s connected as a 12-bit counter. 
The counter can be parallel loaded with a 12-bit word from 
pipeline registers U18, U19 and U20. The multiplexer and D-type 
flip-flop (U21 and U22) at the counter overflow output (U9) is 
present to improve system cycle time and will be described in 
detail later. 
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This design also features a 16-input condition code multiplexer 
using two Am74S251s, which are designated U12 and U14. 
Condibon code polarity control capability has been added to the 
design by using an Am74S158 Two-Input Multiplexer designated 
as U13. The W outputs and Y outputs from U12 and U14 have 
been connected together but only one set of outputs will be 
enabled at a time via the three-state contro! signal designated as 
Rag and Ro. Since the Y output is inverting and the W output is 
-orrinverting, the two-input multiplexer, U13, can be used to 

et the test condition as either inverting or non-inverting. This 
allows the test input on the Am29811A Next Address Control 
Unit, U15, to execute conditional instructions on either the in- 
verted or non-inverted polarity of the test signal. For example, a 
CONDITIONAL BRANCH may be performed on either carry set 
or Carty reset. Likewise, the same CONDITIONAL BRANCH 
might be performed on either the sign bit as a logic one or the sign 
bit as a logic zero. Note that the Am29811A Next Address Control 
Unit has eight outputs. Four outputs to contro! the Am2911's So, 
S,, PUP and FE inputs. Two outputs to contro! the three-state 
enables of the devices connected to the D inputs, i.e., a map 
enable (MAP E) to select the mapping PROMs and a pipeline 
enable (PL E) to enable the three-state Am2918 outputs which 
make up a 12-bit wide branch address field. The remaining two 
Am29811A outputs are for loading and enabling the Am25LS163 
counters. CNT ENABLE from the Am2981 1A is active-LOW while 
the Am25LS163 counter requires an active-HIGH enable, there- 
fore CNT ENABLE from the Am29811A is passed through one 
section of the Two-input Multiplexer (U13) for inversion. An alter- 
Native counter, the Am25LS169, has enable as active-LOW; 
therefore, this inversion through U13 is not required. 


At this point, a discussion of the typical operation of this computer 
control unit is in order. First, bits 0-11 of the microprogram mem- 
ory output word, are connected to the pipeline register desig- 
nated U18, U19 and U20. The Am2918 has been selected for this 
portion of the pipeline register because of its continuous outputs 
and three-state outputs. The three-state oulpuls are connected to 
the D inputs of the Am2911 to provide a branch address 
whenever needed. These 12 bits are designated BRy-BR,4. The 
Q outputs of these same Am2918s are designated Ro-R,, and 
are connected to the parallel load input of the Am25LS163 
Counters. Thus, the counter can be loaded with any value be- 
tween 0 and 4,095. Many designs will take advantage of Ro-Ry, 
and use it as a general purpose field whenever the counter is not 
being loaded or a jump pipeline is not being performed. Using a 
microprogram memory field for more than one function (branch 
address and counter load value in this example} is called FOR- 
MATTING and will be covered in greater detail later. The other 
two devices in the p'peline register shown on the architecture of 
Figure 17 are U16 and U17. First, U17 receives four bits (12, 13, 
14 and 15) from the microprogram memory to provide four-bit 
instruction field to the Am29811A. This four-bil fie!d, designated 
Ry2-Ry5. provides the actual next address contro! instruction for 
the computer control unit. Rig is the polarity control bit for the test 
input and is connected to the select input of the Am74S158 
Two-Input Multiplexer. When Ry¢ is LOW, the signal at the 
Am29817A test input will be inverted, but when Rix is HIGH, the 
test input will be non-inverted. 


The Am74S175 has been used as part of the pipeline register 
{U16) because it has both inverung and non-inverting outputs 
Signa!s Ry7, Rig end Ajg are used to contro! the One-of-Eignt 
Multip'oxer (U12 and 114) A, B and C inputs. Pipeline register 
Output Rzg and A, are used to enable either the U12 outputs or 
the U*4 outpuis such that a one-ol-sixteen multip'exer funclion is 
‘moiamenied. In this design, the TEST 0 input of U14 is con- 
ct ted to ground. This provides a convenient path for converting 


Note Figures 17. ‘8 20 and 24 are al Lack ol the Dock 


any of the conditional instructions to non-conditional instructions. 
That is, any of the conditional instructions can be executed un- 
Conditionally by selecting the TEST 0 input which is connected to 
ground and forcing the polarity control to either the inverting or 


non-inverting condition. This allows the execution of uncondi- 
tional JUMP, unconditional JUMP-TO-SUBROUTINE, and un- 
conditional RETURN-FROM-SUBROUTINE instructions. 


Bit 21 from the microprogram memory utilizes a flip-flop in U17 as 


part of the pipeline register. This output, Ro;, is ysed as the 
enable input to the instruction register. Needless to say, other 


techniques for encoding this enable function in a formatted field 


could be provided. . 


A HIGH PERFORMANCE COMPUTER CONTROL UNIT 
USING THE Am2909 AND Am29803A 


The high perferm:..ce CCU (Figure 18) is of a similar basic 
design as the cu. usly described CCU. The major differences 
are, referrinc ‘c Figure 18, the addition of an extended enabie 
control (Utuy, a vector input (U24 and U25), and an Am29803A 
16-way Branch Control Unit (U23). These performance en- 
hancements are more related to function than to actual circuit 
speed. The use of these enhancements by the microprogram 
provides greater flexibility in controlling a machine's environ 
ment, and can reduce the microinstruction count required to 
perform a particular task, which has the effect of increasing 
overall system throughput. 


In describing this high performance CCU design. those sections 
which remain unchanged from the previous descnption (Figure 
17), will mot be covered again. This includes the mapping 
PROMs, sequencer, Am29811A, counter, condition test inputs 
and associated polarity control, and the pipeline register. The 
areas that will be covered are: extended enable control (U16). 
Vector inputs (U24 and U25), and the Am29803A 16-way Branch 
Control Unit (U23). 


Extended Enabte Control 


Extended enable control is accomplished via an Am74S139 dua! 
two-to-four line decoder in conjunction with the Am29811A next 
address contro! unit. In Figure 17, PL E and MAP E of the 
Am29811A were connected directly to the components that they 
are to contro! (pipeline registers and mapping PROMs, respec- 
tively). Likewise, CNT LOAD and CNT ENABLE are connected 
directly to the counters that they control (with the exception that 
CNT ENABLE requires inversion when using Am25LS163 
counters). In Figure 18, PLE, MAP E, CNT LOAD and CNT 

'NABLE go to the inputs of the Am74S139 two-to-four line de- 
coder (U16). When either PL E or MAP E is LOW, then either 2Y; 
or 2Y2 of U16 is LOW and either the pipeline branch address 
registers or mapping PROMS are enabled. If both PL E and MAP 
E are HIGH, then output 2Y3 of U16 is LOW enabling the three- 
State outputs of U24 and U25 which are altemate oe. ine 
Starling address decoders (alternate mapping PROMs), an 
called VECTOR INPUT in this design. Likewise, CNT LOAD and 
CNT ENABLE follow the same rules, enabling the counter to load 
or count via 1Y,; and 1Y2 of U16. 


Vector Input 


The "Vector Input" provides the system designer with a powerlul 
next starting address control. For example, one possible us*® 
might be as an interrupt vector. For instance, use the “Interrupt 
Request" output of an Am2914 Vectored Prionty Interrupt Con- 
troller (or group of Am2914s) as an input to one of the conditional 
test inputs of multiplexers (U12 or U14). Then connect the 
Am2914 Vector Out lines to the vector mapping PROMs (Vector 
input U24 and U25). The microprogram then could, at the appro- 


priale time, test for a pending interrupt and If present, jump in 
microprogram memory directly to the routine which handles the 
specific interrupt as requested via the Am2914 Vector Output 
ines. This routine will take the proper steps to preserve the status 
of the interrupt system, and then will service the interrupt. This is 
one of many possible uses for the Vector Input. Other possible 
uses include both hardware and software “TRAP” routines and 
so forth. As can be seen, the design presented here uses the 
Vector Enable line (output 2Y3 or U16) to enable an alternate 
Starting address input at the Am2911. This, however, does not 
preclude the use of other devices in place of mapping PROMs as 
the D-input vector source. 


it should be understood that this does not accomplish a “micro- 
interrupt” function in that it 1s not a random possibility. Instead a 
microprogrammed test is made and an altemate microroutine is 
performed. A true “microprogram interrupt" is one that could 


occur at any microinstruction. The Am2910 does not handie this - 


case internally. 


Am29803A 16-Way Branch Control Unit 


The Am29803A provides 16-way branch control when used in 
conjunction with the Am2909 bipolar microprocessor sequencer, 
and is shown as U23 in Figure 18 with its pipeline register U22. 
The Am29803A has four TEST-inputs, four INSTRUCTION- 
inputs, four OR-outputs, and an enable control. The four OR- 
outputs connect directly to the Am2909 OR- inputs (U8 in Figure 
18). The four INSTRUCTION-inputs to the Am29803A provide 
contol over the TEST-inputs and OR-outputs, and are provided 
by the microprogram via the pipeline register U22 (Figure 18). 


Basicalty, the INSTRUCTION-inputs (I9-l3) provide sixteen in- 
structions (0-F 1.) which can select sixteen possible combinations 
of the TEST-inputs and provide a specific output on the OR- 
outputs depending upon the state of the inputs being tested. (The 
subsenpt 16 refers to basic 16.) All possible combinations of 
instruction-inputs, TEST-inputs and OR-outputs are shown in 
Figure 19. 


Note that instruction zero does not test any inputs (a disable 
instruction). Instructions 1, 2, 4 and 8 test one input and can 
Cause a branch to one of two words. Instructions 3, 5, 6, 9, 10 and 
12 test two inputs and can jump to one of four words (a 4-word 
page). Instructions 7, 11, 13 and 14 test three inputs and can 
jump on an eight word page. Instruction number 15 tests all tour 
inputs and the result can jump to any word on a sixteen word 
page. 


USING THE Am29803A 


In the architecture of Figure 18, the Am29803A allows 2-way, 
4-way, 8-way or 16-way branching as determined by selectable 
combinations of the TEST-inputs. Referring to Figure 19, the 
ZEAO instruction (all instruction bits LOW) inhibits the testing of 
any TEST-inputs, thus providing LOW OR-outputs. Any single 
TEST-input selected (To, T;, Tz or T3) will result in OR being 
HIGH or LOW in correspondence with the polarity of the selected 
TEST-input. Selecting any combination of two TEST inputs re- 
sults in the outputs OR and/or OR, being HIGH or LOW, fotlow- 
ing a mapped one-to-one relationship, i.e., ORg and OR, will 
follow the TEST-inputs, but no matter which pair of TEST-inputs 
are selected, their HIGH/LOW condition is mapped to the ORo 
and OR; outputs. Likewise, selecting any three TEST inputs, will 
map their HIGH/LOW condition to the ORp, OR, and OR, out- 
puts. Selecting all four TEST-inputs, of course, causes a one-to- 
one relationship to exist between the HIGH/LOW conditions of 
the TEST-inputs and the corresponding OR-outputs. Refer to 
Figure 19 to verity the relationships between INSTRUCTION- 
inputs, TEST-input, and OR-output. It is very important that the 
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mapping relationship between these signals be completely un- 
derstood. When using the Am28803A TEST-OR capability as 
shown in Figure 18, the microprogrammer must position the 
applicable microcode within microprogram memory so that the 
low-order address bits are available for ORing. Sequencer in- 
structions using the Am2909/2911 D-inputs (JAP, JSAP, JP and 
CJS in particular) are ideally suited for the Am29803A TEST-OR 
capability. The jump-to-location, available via pipeline BRo-BR:, 
or the Am2909/291 1 register, can contain the address of a branch 
table. A branch table is merely a sequential senes of uncondi- 
tional jump instructions. The particular jump instruction executed 
is determined by the low-order address bits; that is, the first jump 
instruction in a branch table must start at a location in micropro- 
gram memory whose low-order address bit (or bits) is zero. Ifa 
single Am29803A TEST-input is selected (2-way branching) then 
only the least significant hit in the beginning branch table address 
needs to be zero. Two Am29803A TEST-inputs selected (4-way 
branching) requires that the branch table start on an address with 
the low-order two bits equal to zero; 8-way branching requires 
three low-order zero bits, and 16-way branching requires four 
low-order zero address bits. Understanding this branch controi 
concept is realty quite simpie. The branch table is located in 
microprogram memory beginning at a location whose address 
has sufficient low-order zero bits to accommodate the number of 
selected Am29803A TEST-inputs. If, for instance, three TEST- 
inputs were selected, the first jump instruction in the branch table 
must be at an address whose low-order three bits are zero, such 
as address OF81,. The second jump instruction in the branch 
table would begin in microprogram memory address OF 9;,. The 
third jump at location OFA4g, the fourth at OF5,¢, etc. Through all 
eight locations (OF 8,,¢-0FF,_¢). Assume the following pipeline in- 
struction (referring to Figure 18): (1) U22 selects three 
Am29803A TEST-inputs, (2) U18 instructs the Am29811A Next 
Address Controller to select the Am2909/2911 D-inputs. (3) U16 
enables the pipeline branch address as the D source, and (4) 
U19, U20 and U21 supplies the address OF8;_ as the branch 
address. The Am29803A TEST-inputs will be ORed into the © 
low-order three bit positions, thus providing a jump entry into the 
branch table indexed by the value of the OR bits. Each instruction 
in the branch table is usually a jump instruction, which allows the 
selection of a particular microcode routine determined by the 
value presented at the Am29803A TEST-inputs. These jump 
instructions are the first instruction of the particular sequence. 
There are, of course, many other ways to use the Am29803A 
16-way Branch Control Unit. 


The microprogram memory address suppliea via an Am2909 
sequencer can be modified by the Am29803A 16-way Branch 


. Control Unit. Remember, however, that the microcode as- 


sociated with this address modification relies on certain address 
bits being zero, therefore this microcode is not arbitrarily relo- 
catable. The above discussion describes using the D-input and 
branching to provide low-order zeroes to use the OR inputs. 
Through proper design, the Register, PC Counter, or File can be 
used equally well. 


THE COMPLETE COMPUTER CONTROL UNIT 
USING THE Am2910 


A detailed connection diagram for a straightforward computer 
control unit using the Am2910 is shown in Figure 20. This design 
utilizes the Am25LS377 as U1 and U2 to implement a 16-bit 
instruction register. The op code outputs from the instruction 
register drive three Am29761 PROMs to perform the op code 
decoding function. These are shown in the diagram of Figure 20 
as U3, U4 and US. The Am2910 sequencer (U6) is used to 
perform the basic microprogram sequencing function. 
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Figure 18. High Performance Computer Control Unit with Am2909/2911. 
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Figure 19. Function Table. 


A 16 input condition code multiplexer function is provided by 
using two Am2922s as U7 and UBS. These devices allow one of 
Sixteen inputs to be tested and the polarity of the test can also be 
determined. The pipeline register consists of U9, U10, U11, U12 
and U13. These devices are edge triggered D type registers and 
have been selected to provide unique functions as required de- 
pending on their bit positions in the pipeline register. An 
Am74S175 was selected for U9 because both a true and com- 
plement output were desired to provide control to the condition 
code multiplexer three state enables. An Am74S174 register was 
selected as U10 because it provides a clear input for initializing 
the Am2910 microprogram sequencer. Three Am2918s were 
selected for U11, U12 and U13 because they have a three state 
output that can be used to provide the branch address field to the 
Dinputs of the Am2910 and they also have a set of outputs that 
can be used to provide other control signais via this field when it 
does not contain a branch address. No specific devices are 
Shown for the microprogram memory as the user should select 
the desired width and depth depending on his design. 


ANOTHER DESIGN EXAMPLE 


The Am2909, Am2910, Am2911, Am29811A and Am29803A 
have been designed to operate in the microprogram sequencing 
section of any digital state machine. Typically, the examples 
shown are for performing the computer control unit function of a 
typical minicomputer class machine. The design engineer should 
Not limit his thinking for the use of these devices simply to that of 


Mmicroprogram sequencing in a computer control unit. These de- " 


vices can be successfully used in other areas of designing such 
aS memory control, DMA control, interrupt control and special 
Purpose microprogrammed machine architectures. In order to 
Provide an example of a design using these devices in something 
other than a typical computer control unit, a microprogrammed 
CRT. controller is described in the following. 


inorder to provide some basis for the design of a CRT controller, 
the requirements of this controller must be ree out. These are 
given as {dllows: 


A) Character size: 5 x 7 dot matrix. The character field will be 7 
dots by 10 horizontal lines thereby providing ample space for 
the 5 x 7 character and the intervening space between 
characters and lines of characters. 


B) 80 characters per line. A standard 80 character per line dis- . 


play will be utilized and there will be 18 character periods 
allowed for horizontal retrace time. 

C) 24 lines of characters per frame. This provides a total of 240 
visible lines per frame (24 lines of characters by 10 horizontal 
lines per character). There are a total of 24 lines provided for 
easel tetrace bringing the total umber, of lines per frame to 

B) Refresh rate: 60 frames per second. Therefore, the horizoy- 
tal line rate will be 264 x 60 = 15,640Hz. As there are atotalof 
80 + 18 = 98 character periods in a line, the character rate 
will be 98 x 15.84 = 1,552.32KHz, and the dot rate will be 7 x 
1.5288 = 10.86624MHz. (Note: No interlace is used.) 


E) It Is assumed that there is a 2K word deep « 8-bit wide © 


character RAM available to the host computer in which it can 
write the ASCH equivalent of the characters to be displayed. If 
scrolling is to be used, the host computer must also write the 
first visible character's address divided by 16;9 into ie 
Am25LS374 “First Address Register’. 

F) This CRT controller must generate an 11-bit.character ad: 
dress that is used by the 2K word deep character RAM. It must 
also generate the required video enable signals and the hori- 
zontal and vertical blanking signals. 


. . 7 


Principle of Operation 


A detailed block diagram of the CRT controller is shown in Figure 
21. The block diagram shows an interface to an SBC-80/10 data 
bus, address bus and control bus. The outputs of the CRT control- 
ler are connected to a CRT monitor on the block diagram. Other- 
wise the block diagram shows a straightforward use of the 
Am2910 and three Am291 1s to implement the CRT control func- 
tion using microprogrammed techniques. The SBC-80/10 was 
selected for this example since it is well known. 


Alogic diagram of the CRT controller is shown in Figure 22. Three 
Am29775 512-word x 8-bit registered PROMs are used to contain 
the 23-bit wide microprogram. While only a minimum number of 
words are used in the design as shown, many additional words 
can be used to add various options (as described later). The 
address for these Am29775 registered PROMs is provided by an 
Am2910 microprogram sequencer. Three Am2911 sequencers 
are used to generate the character address for the character 
RAM. The least significant Am2911 sequencer is connected as a 


. divide by 16 counter. This RAM address is compared with the 


desired last character address (80 x 24 = 1920) value using an 
Am25LS2521 8-bit equal to detector. When the last address is 
detected, it can be sensed at the condition code multiplexer 
(Am25L$153) that is used to select the condition code for the 
Am2910 sequencer. 


The data derived from the 2K word character RAM is decoded by 
a character generator (6061) in this design and the character 
output is parallel loaded into an Am25LS23 shift register. This 
shift register is used to provide the video signal from its Qy output 
to eventually drive the display via an Am74S240 butfer. The 


diagram of Figure 22 depicts an oscillator input source to supply: 
the dot frequency. In this design, a 10.86624MHz oscillator — 


should be connected to this oscillator input point. This oscillator 
input signal is used to clock the shift register containing the 
individual dot bits (dot-on or dot-off) and also drives an 
Am25LS169 counter which divides this frequency by 7 to gener- 
ate the character rate clock. This character rate clock is used 
throughout the controller to provide a timing signal for the state 
machine design. 


An Am25LS168 decade counter is used to generate the line 
inputs for the character generator and to count 10 horizontal lines 
per character space. This counter Is clocked by the horizontal 
blanking signat (HB) and its RCO output is used as one of the 
condition code multiplexer inputs. The RCO output can be tested 
to determine when 10 counts have been executed by the counter 
and it is also used to enable the last address comparator during 
the 10th horizontal line time. . : - 


When the host computer accesses the character RAM, the 
HOST-ACCESS Iine is pulled LOW. This removes the Am2911 
outputs from the character RAM address bus. When this access 
occurs, improper data may be present at the shift register inputs. 
Thus, the character generator PROM output is disabied by the 
HOST-ACCESS signal during this time. 


Wtien power is applied to this CRT controller or whenever it is 
reset, the RESET line is driven LOW. This signal is inverted 
through an Am25LS240 and then disables a part of the pipeline 
register outputs as well as enabling one half of an Am25LS241. 
This Am25LS$241 inserts LOWs onto the instruction (I) inputs of 
the Am2910 sequencer. Then, the next character rate clock will 
force the microprogram address outputs to zero and the micro- 
program for the CRT contoiler as shown in Figure 23 will be 
executed starting at address zero. 
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Computer Control Unit with Am2910. 
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Figure 21. CRT Controller Block Diagram. 


The Microprogram for the CRT Controller 


Table 6 shows a complete description of the microprogrammed 
CRT controlier microcode. Execution of these microinstructions 
ts controlled by the Am2910 sequencer, 


As can be seen in Table 6, several techniques were used in this 
short microprogram to provide the different counting require- 
ments of this CRT controller. Although only one format (60 
characters per line, 24 lines per frame) was shown here, the 
designer can easily configure his own format by simply changing 
some constants in the microprogram. As an exercise, the reader 
ts encouraged to find a means to program the CRT controller for 
diferent formats. The host computer software could configure the 
controller format by using an additional register similar to the 
“First Address Register’. This will be discussed In an appendix at 
the end of this chapter. 


A complete wiring diagram for the microprogrammed CRT con- 
troller is shown in Figure 24. This can be used directly with the 
interface shown in Appendix A such that the CRT controller can 


Note’ Figure 24 is at back of the book. 


be connected directly to an Am9080AK based microprocessor 
system. Appendix A also depicts the use of a 2K word x 8 b& 
character RAM as described previously. 


CRT Controller Timing Considerations 


As was discussed earlier, the character clock frequency for the 
CRT conwolier is 1,552.32KHz. Thus, itis desirable to calculate 
the longest path of the design to ensure that none exceed ths 
clock period of 644.1ns. The timing diagrams of Figure 25 depict 
anumber of ditferent paths with the associated propagation delay 
catculations. 


When all of the timing diagrams of Figure 25 are examined, it wil 
be found that only three show propagation delay times of over 
200ns typical. Of these, the worst case is 318ns as shown in 
Figure 25(i). Since the requirement of the design is to insure that 
none exceed 644,1ns, we have more than a 2 to 1 marginin the 
design based on the typicals. Thus, we can see that the design 
will operate properly even over the full military temperature range 
and power supply variations based on this analysis. 
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Figure 24. CRT Controller. 
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Figure 23. Microprogram for the CRT Controller. . 


TABLE 6, DESCRIPTION OF THE MICROPROGRAM FOR THE CRT CONTROLLER. 


Is LOW 80 that the micro- 
Program incrementer wall not 
rcrement. 


ZERO and C,, are stl LOW, 
80 NO change in thus device. 


Mote: Figure 24 ls at beck of the book. 


High 
Order Am29118 


Both S, and So are HIGH to 
that the D inputs will be routed 
to the Y outputs. These inputs 
wil come from the First Address 
Register (the Am2910 VECT is 
LOW). Cy, is LOW (see fief 
cotumn); therefore the micro- 
Program counter wail not incre- 
ment. FE ts LOW (and PUP fa 
always HIGH) causing the pre- 
sen! output to be pushed on 
the stack. The character ad- 
Gress is already the “First 
Character Address™. 

S; and S> are LOW; thus, the 
Y outputs wil be the current 
PC, (the same as the Y out- 
puts were in the previous step). 
C,, ls st LOW, therefora no 


The CJV instruction is selected. 
Therefore, VECT output will be 
LOW, enabling the “First Ad- 
Gress Register onto the intemal 
B-bet bus. CCEN is LOW: the 
MUX ls selecting a constant 
HIGH, and the sequencer will 
address the nexi consecutive 


microprogram address (word 1). 


LDCT Is selected and the nurn-” 


ber of character-rows per frame 
manus 1 (2319) is loaded into 
the Am2910 register/counter. 
The sequencer addresses the 
Next microinstructon. 


Tha Am2910 will generate the 
fexi microprogram address. 


icant bits onto the Am2911's file, 
and continues to the next micro- 


instruction. 


This is the starting location for 
the main loop. 


TABLE 6. DESCRIPTION OF THE MICROPROGRAM FOR THE CRT CONTROLLER (Cont). 


Micro- 
program Low High . 
Address Order Am2911 Order Am29118 Am2910 
3 This Am2911 now counts up inutially these two Am2911s With the MUX selectng the 


using nts PC meromenter. At 
the final count (moving from 
Fy9 to 0) ts Cay output wil 
be HIGH. 


will not change theit Y outputs 
since the C,, input is LOW. 
However, when the.C,, input 
goes HIGH, the intemal PC 
wall incramant 


Cr+4 Output from the least 
significant Am2911 sce, the 
CC inpet to the Am2910 se 
quences will be LOW unt tha 
Am2911 counts 16. CC = 
LOW wall cause the next mi- 
croprogram address to be the 
pipebne register contents; this 
is also the current micropro- 
gram address (word 3). When 
Ch+4 goes HIGH, CC wil 
HIGH and together with 

© LOW. wil force the Am2310 
to address the next consecu- 
tive mucroprogram address (4). 


Same as 3, except that at each 


Tha MUX selects the 
Am25LS 168 ten-bne-counters 
ACD as the cond.ton coda mn- 
put to the Am2910 (CC). if the 
ine count is fess than 10, CC 
wil be HIGH and the next m+ 
croinstructon wil be addressed. 
Uf the tenth Ene of a character 
row is executed, CC wil be 
LOW and a JUMP-TO-SUB- ‘ 
ROUTINE to an address, sdp- 
phed by the papeline regsster 
(TENTH") will be exegsted. 
The MUX now selects the Last 
Address Comparator output for 
GE. it the currentInore Sugral- 
icant bets of the character- 
address coincide with the last 
address + 1 (192010/16) a 
subroutine call wil be per- 
formed to "LASTA™. Other- 
wise, the microprogram will 
continue consecutively. 


Same as at address 3. 


Continues to count (note that it 
enters this ine with an output 
ot zero). 


Since Cy, is LOW (see leh 
column) no change occurs in 
these devices. Note thal the 
Y outputs contain the more 
significant bits of the address 
of the first character of the 
next character row. 


Continues to count through 
tha internal PC incrementer. 


No change until C, goes 
HIGH, then count 


Continues to count. At count 
15, Cr44 goes HIGH. 


Unconditionally (CCEN = 
HIGH) steers tha micropro- 
gram to the address supphed 


ft doesn’! matter what this 
device does at this microstep 
becayse at the next micro- 

Step A will rececve LOW on by the pmelina register 
ts input. (‘MAIN = 2). 


Continues 10 count. H intema! counter is equal to 
zero, it means that 24 character 
tows were already displayed 
and we are al the bottom of the 
CRT cssplay. A vertcal retrace 
period is needed and the mi- 
croprogram will contnue 
sequentally. If the counter is 
nol yet zero, we do not need 
to execute the vertical retrace 
routine and the next address 
will be supplied by the pmpe- 
regislar ("GOBACK™ = 
1448) while the intemal 
counter is decremented. | 


Comments 


Ths microsiep will be executed 


16 times. (Note thal 60 = 5 x 16.) 


The microprogram itsett is used 
8S 8 counter in this appiica- 
bon since the count only 5, 
the mucroprogram is relatvely 
shorn versus the memory's depth 
and thus is a convenient maans 
to economize on chp count. 


We are now’ at the end of a TV 


line. Therefore, the Honzontal 


Blanking Signa! (HB) is HIGH 
The least sigrufcant Am2911 
sea now counts the invisible 
characters during the horizon- 
tal retrace. 


Note that 80 characters/row 


and 24 rows/trama requires & 


1920, 9 word memory. When 
the last mamory tocahon 
(19209) & read out, the scan will 


begin a! 0. 


Waiting for the least significant 


Am2911 to count fo 15. This 
microstep wall be executed 
as many tes as necessary 
to accomplish this. 


Performing a JUMP to the 
beginrang of the main-ioop - 
(address 2). 


The decision whether the botiom 
of the CAT (End of Frame) is 

reached or not is made ntemaly 
in the Am2910, using its counlar. 


55 


56 


TABLE 6, DESCRIPTION OF THE MICROPROGRAM FOR THE CRT CONTROLLER (Cont). 


Low HIgh 

Address Order Am2911 Order Am2S1t18 Am2910 
ZERO « LOW, there!sre, out- 
it Y = O. Thus cs necessary 
‘WO assure that Cyyg cs LOW. 


Comments 


AS wo ore al the End of Frame, 
the “First-Address-Register" 
contents (enabled by the 
Am2910's VECT output) Is 
pushed onto the Am291 1's fils. 
Note that the Vartical Blaniung 
Signal (VB) goes HIGH. 


(14619 + 1) x 1649 @ 235219 
equals the number of character. 
penods dunng vertical retrace. 
Load.ng 2352) directly into the 
Am2910's counter would require 
7 bits. Using scheme we 
reduce the mcroprogram width. 


‘Same as st address 0. 


Same as at address 0. 


The intemal counter is loaded 
with 14619. Supplied by the 
Ppelne register. The nex? 
consecutive microstep 


This is a proparatory step tor the 
2 step “Vervcal Ratrace” doubie- 
nested loop. 


HIGH (supplied from a con- 
stant HIGH by the MUX), the 
Next address (101,_) will ba 
pushed onto the Am2910 lile, 
the counter will not be af- 
fected and the next consecutve 
Microstep wil be addressed. 


The MUX supples tha Cy 44 
outpul of the tess significant 
Am2911 shes to the Am2910 
(eo input. While this signal is 
low, the Am2910 will select 
the pipelina register as ihe 
source of the next microin- 
structon address. The current 
address (10,,) beng waitten 
here, tus mstruction wil be exe- 
cuted unul CE goes HIGH. Then 
the nexi consecutive iastruc- 
tion wall ba selected through 

the Am2910 internal PC. 


I the final count has been 
feached, the next micro 
instructon wil be addrassed 
and the internal slack will ba 
popped (adjusted). Otherwse, 
the naxt mucroinstructon ad- 
dress will ba the one residing 
on the lop al the stack (whch 
Is 104). 


104 Counts. When fina! count 1s 


No change with C, = LOW; 
teached, Cyy4 © HIGH. 


increments with C, = HIGH. 
This has no prachcal atlect as 
the HB signal is HIGH, and at 
tho beg:nnung of the next wis- 
ible hne, the correct addrass 
will ba fetched from the ide 
(address 2). 


Again, this is a possible way to 
dwell on a cortain mucrosiep 
waiting a condition ta change 

its status (like address 3 through 
7). This ts the intorna!l loop of 

4 double-nasted loop system. 


This is the extamal loop of the 
double-nasted loop system, 
which counts the vertical retrace 
intarval. By adding a sing'e m- 
cromstruction tha chip count 
was teduced. 


Same as al address 1. Reinitializes tha Am2910 intemal 
counter with the number of 
character rows per frame. 


End of “TENTH” subroutine at 
End of Frama (with vertical 
felraca). 


Unconditional retum from End of “TENTH™ subroutine 
subroutine. without vertical retrace. 
$$$ 
Pushes zero into hile. Unconditional retum from A one-line subroutine to reink 
- Bubroutna. tialize character address 10 zero. 


Unconditional return trom 
subroutine. (CCEN = HIGH). 


— 
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Figure 25. (Cont.) 


Figure 25. (Cont) 
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Figure 25. (Cont) 


SUMMARY 


The Am2910 provides a powerful solution to the microprogram 
memory sequence control problem. The Am2910 is a fixed in- 
Struction set, 12-bit wide microprogram sequencer. In addition, 
the Am2909, Am2911, Am29811A and Am29803A provide 
another solution to the microprogram sequencing problem. 
These devices are bit slice oriented and provide more potential 
fiexibility to the microprogram sequencing solution. All of these 
devices are particularly well suited for the high performance 
computer control unit and structured state machine designs using 
overlap fetch of the next microinstruction — also referred to as 
instruction-data-based microprogram architecture. 


These Am2900 family microprogram control devices offer the 
highest performance LSI solution to the problem of microprogram 
control. They provide a number of conditional-branch source 
addresses as well as conditional jump-to-subroutine and 
conditional-retum instructions. In addition, several techniques for 
timed and untimed looping are provided such that loops from one 
to several microinstructions can be executed. All of the devices 
described in this chapter are competitively priced and currently 
available. In addition, all of these devices are available with 
specifications guaranteed over the full commercial temperature 
range and power supply tolerance as well as the full military 


temperature range and power supply tolerance. All of these de- 


vices undergo 100%; reliability assurance testing in compliance 
with MIL-STD-883. 
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APPENDIX A 


Figure A1 shows the logic diagram of an interface circuit used to 
connect the microprogrammed CAT controller to any Am9080A 
type processor. Sixteen address-lines, eight data lines, a 
memory-read, a memory write and an /O write signal are as- 
sumed to be used in an active LOW polarity. 


An Am25LS2521 8-bit comparator is used to decode the addres- 
ses of the 2K by 8 character memory. This memory can be placed 
anywhere in the memory space In increments of 2K by using 5 
DiP-switches. The comparator is enabled by the presence of 
either the MMR or the MMW signal. The output of this comparator 
is the HOST ACCESS signal. 


The HOST ACCESS signal enables the two Am25LS240 buffers 
which connect the processor address bus to the character mem- 


Note. Figure A2 is af beck of the book. 


ory address bus. It also enables one half of an Am25LS241 butter 
transferring the MMR or MMA active LOW _ to the proper 
data buffer enable (Am25LS240's) and to the pins of the four 
Am9114 memories in case of a memory write operation. The CS 
of two of these memories are driven by Aig while the CS of the 
other two memories are driven by Ajo, thus forming a 2K by 8 
memory space. 


An Am25LS2521 8-bit comparator is enabled by the VOW control 


- line. fn maiches the settings of the DIP switches at the B inputs of 


the corhparator, an OUT n instruction will write the data into the 
Am25LS374 “First Address Register’. 


Figure A2 shows the complete wiring diagram of this interface 
Circuit. 
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Figure A1. CRT Controller. 
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Figure A2. CRT Controller. 
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APPENDIX B 


General 


A software emulation of the CRT controller was written in 
BASIC-E and run on the System 29 support processor. Figure B1 
fs a printout of this program. 


Notations 


For reference purposes, each clock pulse (CP) in the program is 
numbered. The clocks are character-rate clocks. A subscript"10” 
signifies that this vanable belongs to the Am2910 (e.g. R10 = the 
contents of the Am2910 Register Counter) and similarty a sub- 
Script 11 signifies tha Am2911 dependent variables (e.g. Y11 — 
the Y outputs of the two more significant Am291 15). 


Usually the normal function names were used though for the 
active LOW functions the bar was deleted for simplicity. A 0 
signifies always a LOW and 1 signifies HIGH. Other abbrevia- 
tons used in‘the program: 


MA = Microprogram Address (Y output of the Am2910) 
* CA = Character Address 
PC = Program Counter (internal) 
R = Register (intemal) 
F = File (intemal) 
SP = Stack Pointer (intemal) 
TENC = The Am25LS168 decade counter 
L4B = The 4 least significant bits of CA (the Y outputs of 
the less significant Am2911 
CN = Cany-in into the less significant Am2911 
CN4 = Carry-out from the less significant Am2911 
CN4 = Canrry-in to the next significant Am2911 
{10 = The Am2910 instruction 
HB = Horizontal Blanking signal (active HIGH) 
VB = Vertical Blanking signal (active HIGH) 
CRM = Maximum Clock Pulse (at which the program 
stops) 


Description 


The different groups and subroutines of the emulation program 
are as follows: (See Figure B1). 


<1000 series: The microcode. 


1000 series: 


2000 series: 
2500 series: 


3000 series: 


4000 series: 
5000 series: 


5500 series: 


6000 series: 
7000 series: 
9000 series: 


Subroutine 50 is the 
Am25LS168 decade counter clocking routine. 
TENTH is the RCO output of this device. 

This is essentially the Am2910 emulation. 
Note the definition of the two functions 


"FNFAIL and FNPASS at the beginning of the 


program, compare to the Am2910 instruction 
definitions in its data sheet. 

The Am25LS153 multiplexer emulation. 

The less significant Am2911 emulation. Note 
that the only input to this device is ZEROL. 
CN and the Intemal PC (called L4B) are con- 
trolled in the CLOCK Subroutine (4000 series). 
The two more significant Am2911's emulation, 
Sq and S, are treated as a single number 
(ranging trom 0 through 3) and denoted by 
$11. - 

The Clocking routine. 

The main emulation routine. It includes the 
Am25LS2521 comparator routine and checks 
the Clock Pulse against CPM to determine 
end of run. 

Emulation parameter setup (initialization). 
The starting and ending CP numbers, MA, 
TENC, R10 and VECTOR (The “First Address 
Register") can be set. 

Sets up the print-out parameters 

Printout subroutine 

Sets the program mode: RUN, PRINT or QUIT 
(return to CP/M) 


The emulation was exercised to evaluate fifteen different perfor- 
mance aspects of the CRT Controller. The results indicated that 


‘In all cases, the design operated as desired. 


REM 


REV212 

PRINT REV 

9000 REM HEADER 
PRINT 
PRINT 


REM 


REM 
REM 
REM 
REM 
REM 
REM 
9100 


REM 
9120 
REM 
9130 


REM 
9110 


REM 
6000 


6010 


PRINT © 9090-96 96-96 56-96 96.06 96-96-36 90 46-96-90 90 90-96-96 96-96-96 30 90 90 0 90 00 EF 0 OE i EE 0 8 8 2 0 EB OO OE 
PRINT. ‘ 

PRINT : 

PRINT * A MICROPROGRAMMED CRT CONTROLLER EMULATION® 

PRINT 

PRINT 

PRINT © 96-96-9606 96-96 56-06-26 36-96-96 90 30-40 96-20 90 6-00 26-90 06-96 00 20 OE 0 0 OE 00 08 8 2 0 2 OE 2 0 2 8 9 8 0 28 8 0 2 26 
PRINT 

PRINT 

PRINT ® BY MOSHE M. SHAVIT® 

PRINT ° ADVANCED MICRO DEVICES* 

PRINT ° FEBRUARY 27, 1978° 

PRINT 

PRINT 


DIM F10¢6) 
DEF FNFAIL=CCEN=0 AND CC=1 
DEF FNPASS=CCEN=1 OR CC=0 


GOTO 6000 REM PROGRAM PARAMETERS (REMOVED REV 6) 
<--REV 6 


PRINT 
PRINT 
PRINT 
INPUT °R-UN, P-RINT OR Q-UIT °;MODES 
IF LEN(MODES)=0 THEN GOTO 9100 | 
MODE=ASC (MODES )-79 
IF MODE<1 OR MODE > 3 \ 
THEN PRINT MODES; * IS INVALID*:\ 
GOTO 9100 
ON MODE GOTO 9110,9120,9130 


RETURN 


REM RUN 
PRINT 
INPUT °PUT RESULTS ON FILE (0 IF DIRECT PRINTOUT)= “*;WFILES 
PRINT °CP= "sCP3"MA= °sMA;"VECTOR= ";VECTOR;\ 
"CPM= ";CPM;°ROW= "°524-R10 
INPUT "INITIALIZE (Y OR Ns CP,MA=0 IF N)°7SS 
IF Sss="y" \ 
THEN GOSUB S500 \ REM INIT. 
ELSE CP=0 =: MA=0 U 
IF WFILES="0" \ 
THEN GOTO 6010 \ REM DIRECT PRINTOUT 
ELSE FILE WFILES s GOTO S000 REM MAIN 


REM PRINT : 
PRINT 

INPUT "GET RESULTS FROM FILE=";RFILES 
FILE RFILES 


REM PRINT PARAMETERS 
PRINT 


PRINT “OUTPUT FORMATS:®* 
Figure Bt. 


PRINT * A=CP AND CA ONLY* 
PRINT * B=CP,CA,HB,VB,MA® 
PRINT * C=CP,CA,MA,TENC,R10° 
PRINT * DRALL" 

PRINT 


INPUT "FORMAT#=° ;FORMATS 
IF LEN(FORMATS)#0 THEN GOTO 6010 
IF ASC(FORMAT$)<65 OR ASC(FORMATS)>468 \ 
THEN PRINT FORMATS;° IS ILLEGAL" eN\ 
GOTO 6010 
PRINT 
REM 
6020 REM 
IF WFILES NE "0" \ 
THEN CONTROLS="A® s\ 
GOTO 6030 
PRINT “CLOCK CONTROL® 
PRINT ® . ° A=CONTINOUS® 
PRINT * B=STEP® 
ANPUT *CONTROL=® ;CONTROLS 
IF LEN(CONTROL$)=0 THEN GOTO 6020 
IF ASC(CONTROLS$)<65 OR ASC(CONTROLS)>66 NV 
THEN PRINT CONTROLS;° IS ILLEGAL" <\ 
GOTO 6020 
PRINT 
REM 
6030 PRINT "OUTPUT CONTROL® 


PRINT ° A=AT EACH CF®° 

PRINT ° B=AT EVERY N-TH CP* 
PRINT ° C=MANUAL CONTROL ° 
PRINT ° 


D=STARTING AT CPS AT EVERY CP® 
PRINT ° E=STARTING AT CPS AT EVERY N-TH CP® 
INPUT "OUTPUT=" sQ0UTPUTS 
IF LENCOUTPUTS)=0 THEN GOTO 6030 
IF ASC(OUTPUTS)<65 OR ASCCOUTPUTS)>69 N\ 

THEN PRINT OUTPUTS; * Is ILLEGAL" s\ 

GOTO 6030 ‘ 

O.CTL=ASC (OUTPUTS )~64 - 


ON O.CTL GOTO 6090, 6032, S090, 6034,6036 
6032 INPUT "N="5N 


H=0 
GOTO 6090 
6034 INPUT "CPS= *=CPS 
GOTO 6090 
6036 INPUT *CPS= *;CPS 
INPUT "N= "5N 
H=0 
GOTO 6090 
REM ; 
6090 FORMAT = ASC(FORMATS)-64 
ON FORMAT GOSUB 6190,6300,6200,6100_— 
IF WFILES="0* THEN GOTO 5000 REM MAIN 


REM 
6900 PRINT 
. IF END $1 THEN 6910 


FOR I=1 TO 2 STEP oO REM DO UNTIL END OF FILE 

READ ¢1; CP,R10,F1,SP10,PC10,CA,MUX,CC,CCEN,MA,TENC,\ 
CN4,F11,HB,VB 

F10(SP10)=F1 

GOSUB 7000 REM PRINT 

GOSUB 5200 REM ESCAPE (REV 7) -- 

IF S=155 THEN PRINT:PRINT “ABORTED AT °3CP-3 GOTO 6910 

NEXT I 
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REM 
6910 


REM 
6100 


6190 
REM 
6200 


REM 
6300 


REM 
REM 
7000 


REM 
7002 


REM 
7003 
REM 
7004 
REM 
7005 


REM 
7010 
REM 
7100 


REM 
7200 


REM 
7300 


REM 
7400 


REM 
REM 
5000 


CLOSE 1 
QUT 100,12 REM PRINTER PAGE EJECT (REV 7) 
GOTO 9100 


PRINT 

PRINT °CP*°,°R10°,°F10°, °SP10°%, *PC10° 
PRINT °CA®,*MUX",°CC", °CCEN*, *MA® 
PRINT “TENC®,°CN4°,‘°F11",°HB®,°VB" 


-PRINT 


RETURN 
PRINT 
PRINT 
RETURN 


PRINT 


PRINT "CLOCK®, "CHAR.ADDR®, "H.BLANKING", °V.BLANKING®, °NEXT MA" 


RETURN 


REM PRINT SUBROUTINE 
ON O.CTL GOTO 7010,7005,7002,7003, 7004 


INPUT "OUTPUT CY OR N)";SS 
IF S$="Y¥* \ 
THEN GOTO 7010 N 
ELSE . RETURN 


IF CP<CPS THEN RETURN ELSE GOTO 7010 
IF CP<CPS THEN RETURN ELSE GOTO 7005 


M=M+1 
IF M=N THEN M=0 = GOTO 7010 ELSE RETURN 


ON FORMAT GOTO 7100,7200,7300,7400 
e : 35 

PRINT °CP= *;CP,°CA=°;CA 

RETURN LeMtuat 


IF HB=0 THEN HRS="L° ELSE HRS=* H® 
IF VB=0 THEN VBS="L° ELSE VBS=* H* 
PRINT -CP,CA,HBS,VBS,HA 

RETURN 


PRINT 
PRINT CP,CA,R10,TENC,MA 
RETURN 


PRINT 
PRINT CP,R10,F10(SP10) ,SP10,PC10 
PRINT CA,MUX,CC,CCEN,MA 


PRINT TENC,CN4,F11,HB,VB 


RETURN 

REM MAIN ROUTINE 

REM 

GOSUB 4000 REM CLOCK 

REM FETCH MICROCODE 

ON MA+1t GOSUB 30,2,3,4,5,6,7,78,7,10,11,12,13,14,15,16,17,18,19, 20, 21,22 
GOSUB 2500 REM 2911L 

GOSUB 3000 REM 2911H 
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*CLOCK*, "CHAR.ADDR",°2910 REG.",°LINE CNTR.°,°NEXT HAS 
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CA=Y11"1464+L4B 8 REM CHARACTER ADDRESS 
REM COMPARATOR NEXT 
IF Y11=120 AND TENTH=0 \ REM REV 8 


THEN COHP=0 \ 
ELSE COMP=1 


GOSUB 2000 REM — MUX 
GOSUB 1000 REM 29710 
REM REV 6 


IF WFILES="0* THEN GOSUB 7000 \ REM DIRECT PRINTOUT 
ELSE PRINT #1;CP,R10,F10(SP10),SP10,PC10,CA,MUX, \ 
cc, CCEN, HA, TENC,CN4, Fil, HB,VB 


IF CONTROLS="B°* THEN INPUT Ss REM SINGLE STEF 
REM CHECK END OF RUN 
GOSUB 5200 REM ESCAPE (REV 7) 
IF S=155 THEN PRINT:PRINT “ABORTED AT *;CP =: GOTO 5100 
IF CP<CPM THEN GOTO S000 REM REPEAT MAIN 
REM 
3100 IF WFILES NE "O°" THEN CLOSE (1) 
OUT 100,12 REM PRINTER PAGE EJECT (REV 7) 
GOTO 9100 
REM 
REM 5200 SUB REV 7 
5200 REM ESCAPE SUBROUTINE 
S=INP(97) 
S=INT(S/2) 
$=S/2-INT(S/2) 
IF S NE O THEN S = INP(96) 
: RETURN 
REM 
5509 REM INITIALIZATION 
PRINT 
SP10=1 


PRINT *MA= "5HA 
5505 INPUT "NEW MA (Y OR N)°;SS 
IF SS="N* THEN GOTO 5510 


INPUT tay (O0<=MAC22)"5MA 
MA=INT 


IF Maco "OR MA>21 \ 
THEN PRINT MA;* IS ILLEGAL" : 
. GOTO 5505 
“ MA=O0 THEN TENC=0 = HB=1 = TENTH=1 


eo 
7 


.REM 
9510 NT 
PRINT *°VECTOR= ";VECTOR™ 
S515 INPUT "NEW VECTOR (CY OR N)"3SS 
IF SS="N° THEN GOTO 5520 
INPUT *VECTOR=(0<=VECTOR<120) °sVECTOR 
VECTOR=INT(VECTOR) 
IF VECTOR<O OR VECTOR>119 \ 
THEN PRINT VECTOR;" IS ILLEGAL® :\ 


GOTO 3515 
REM 


5520 PRINT 
PRINT "CP= °;CP 
INPUT "NEW CP (Y OR N) *3SS 
IF €$="N* THEN GOTO 5530 
5525 INPUT *CP(>=0)= "CP 
CP=INT(CP) 
IF CP<O THEN FRINT CP;* IS ILLEGAL" =: GOTO 5525 
REM 
S556 PRINT 
PRINT “CPH= "sCPH 
5535 INF JT "NEW CPM (Y OR ND*sSS 
IF S$="N* TREN GOTO 5540 
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INPUT *CPM=(CP+1<CPM)";CPM 
CPM=INTCCPM) 
IF CPM<CP+1 THEN PRINT CPM;* IS ILLEGAL®;°CP= *sCP :GOTO 5535 
REM . . 
5540 PRINT 
PRINT °TENC= *sTENC 
IF MA=0 THEN GOTO 5550 
5545 INPUT *NEW TENC (Y OR N)*°5S$ ' 
IF S$="N* THEN GOTO 5550 . 
INPUT *TENC=(0<=TENC<10) * sTENC: y 
TENC=INT(TENC) : 
IF TENC<O OR TENC>9 \ 
THEN PRINT TENC3* IS ILLEGAL® =\ 
GOTO 5545 
IF TENC=9 THEN TENTH=0 ELSE TENTH=1 
REM 
5550 PRINT 
PRINT "R10= "R10 
5555 INPUT "NEW R10 CY OR N)°3S$ 
IF S$=°N* THEN GOTO 5560 
INPUT "R10 (0<=R10<25)=" sR10 
R1O=INT(R10) | 
IF R10<O OR R10>24 THEN PRINT R10;° IS ILLEGAL* =: GOTO 5555 
REM 
55460 REM 
RETURN 
REM 
REM 
REM 
30 110=6 
CCEN=0 
MUX=3 . 
S11=3 
FE=0 
ZEROH#=1 
ZEROL=0 
CN=0 
HB=1. REM REV 2 
VB=0 
PL=0 
RETURN 
REM 
2 - 110=12 
$11=0- 
FE=1 i 
ZEROH#1 I 
ZEROL#=0 \} 
CN=0 i 
HB=1 REM. REV 2 | 
VB=0 | 
PL=23 
RETURN | 
REM | 
3 110=14 i 
$11=2 
FE=1 | 
ZEROH=1 ' 
ZEROL=6 | 
CN=L 
HB=1 REM REV 2 
VB=0 
RETURN 
REM 


110=3 
sg Figure B1 (Cont.) 


a 


REM 


REM 


REM 


REM 


CCEN=0 
MUX=1 
$1120 
FE=1 
ZEROH=1 
ZEROL=1 
CNF1 
HB=0 
VB=0 
PL=3 
RETURN 


110=3 
CCEN=0 
MUX=1 
§11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
HB=0 
VB=0 
PL=4 
RETURN 


110=3 
CCEN=0 
MUX=1 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
HB=0 
VB=0 
PL=5 
RETURN 


11083 
CCEN=0 
MUX=1 
S11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
HB=0 
VB=0 
PL=6 
RETURN 


11053 
CCEN=0 
MUX=1 
$1120 
FEH1 
ZEROH=1 
ZEROL#1 
CN=1 
HB=0 
VB=0 
PL=7 
RETURN 


Figure B1 (Cont) , 


REM 


REM 


REM 
11 


REM 
12 


REM 
13 


REM 
14 


I10=1 
CCEN=0 
MUX=0 
§11=0 
FE=1 
ZEROH=1 
ZEROL#=1 
CN=L 
GOSUB 50 REM TENC 
VB=0 
PL=12 
RETURN 


I10=1 
CCEN=0 
MUX=2 
S11=0  __ 
FE=1-° 15 = 
ZEROH=1 || 
ZeEROte1 
CN=1 ys 
GOSUB 50 
VB=0 - - 
PL=21 
RETURN 


110=3 
CCEN=0 
MUX=1 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 


*CN=1L 


GOSUB 50 
VB=0 
PL=10 
RETURN 


110=3 rh 
CCEN=1 

S11=0 

FE=1 ecry 
ZEROH=1 
GOSUB 50 
VB=O - 

PL=2 0... 
RETURN 


I10=9 
§11=0 


FE=0 REM REV 5 


ZEROGH=1 
ZEROL=1 
CN=1 
GOSUB 50 
VB=0 
PL=20 
RETURN 


110=6 
CCEN=0 
MUX=3 
S11=3 


REM 
15 


REM 


REM 
16 


REM 


RE 
18 


REM. 


19 


REM 
20 


FE=0 REM REV 10 
ZEROH=1 ; 
ZEROL=0 

GOSUB 50 | 

VB=1 

RETURN 


110212 

S$11=0 REM REV 10 
FE=1 REM REV 10 
ZEROH#1 

ZEROH=1 REM 
GO8UB SO 

VB=1 

PL=119 

RETURN - 


110=4 
CCEN=0 
MUX=3 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
GOSUB 50 
VB=1 
RETURN 


110=3 
CCEN=0 
MUX=1 
§11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
GOSUB 50 
VB=1 
PL=16 
RETURN 


110-8 

$11=0 

FE=1 ‘ 
ZEROHF1 - 
ZEROL=1 

CN=1 

GOSUB 50 

VB=1 

RETURN 


110=12 
$11=0 
FE=1 
ZEROH=1 
ZEROLSL 
CN=1 
GOSUB 50 
VB=1 
PL=#23 
RETURN 


110=10 


REMOVED REV 10 
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CCEN=1 
$11=0 
FE=1 
ZEROH@=1 
ZEROL@=1 
CN=1 
GOSUB 50 
vBAi 
RETURN 

REM 

21 - 110210 
CCEN=1 
$11=0 
FE=1 
ZEROH=1 
ZEROL=1 
CN=1 
GOSUB 50 
VB=0 
RETURN 

REM 

22 110-10 
CCEN=1 
FE=0 REM REV 9 
ZEROH=0 
ZEROL@=1 REM REV 9 
CN=1 
GOSUB 50 
VB=0 
RETURN 

REK 


50 REM TEN-LINE-COUNTER CLOCKING SUBROUTINE 
IF HB=1 THEN RETURN 
HB=1 
TENC=TENC+1 
IF TENC=9 THEN TENTH=0 ELSE TENTH=1 
IF TENC=10 THEN TENC=0 
RETURN 
REM PUSH AND POP SUBROUTINES REMOVED REV 3 
1000 REM 29710 INSTRUCTIONS SUBROUTINE 
ON I10+1 GOTO 1100,1110,1120,1130,1140,1150,1160,1170,11680, \ 
= 1190,1200,1210,1220,1230,1240,1250 
4 


1100 REM JZ = 
MA=0 REM 2910 Y 
SP1i0=0 REM 2910 STACK POINTER (=0 REV 3) 
RETURN : 
REM 
1110 REM CJS 
IF FNFAIL N 
THEN MA=PC10 \ 
ELSE MASPL =\ 
PUSH=1 REM REV 3 
RETURN 
REM 
1120 REM JMAP 
PRINT "JMAP NOT PROGRAMMED" 
RETURN 
REM 
1130 REM CUP 
IF FNFAIL N 
THEN MA=PC10 \ 
ELSE MA=PL 
RETURN Figure Bt (Cont) 


REM 
1140 


REM 
1150 


REM 
1160 


REM 
1170 


REM 
1180 


REM 
1190 


REM 
1200 


REM 
1210 


REM 
1220 


REM 
1230 


REM 
1240 


REM PUSH 
IF FNPASS THEN R10=PL REM LOAD COUNTER 
MA=PC10 
PUSH=1 REM REY 3 
RETURN 
REM JSRP 
PRINT "JSRP NOT PROGRAMMED® 
RETURN 
REM Cuv 
IF FNFAIL N 
THEN MA=PC10 \ 
ELSE MA=VECTOR 
RETURN. 
REM JRP 
IF FNFAIL N 
THEN MA=R10 N\ 
ELSE MA2PL 
RETURN 
REM RFCT 
IF R10=0 \ 
THEN MA=PC1O =:N\ 
POP=1 N 
ELSE MA=F10(SP10) =\ 
R10=R10-1 
RETURN 
REM RPCT 
IF R10=0 N\ 
THEN MA=PC10 N\ 
ELSE MA=PL 2N\ 
R10=R10-1 
RETURN 
REM CRTN 
IF FNFAIL N 
THEN MA=PC10 N\ 
ELSE MA2F10(SP10) s\ 
POP=1 REM REV 3 
RETURN 
REM CJPP 
PRINT "CJPP NOT PROGRAMMED" 
RETURN 
REM LDOCT 
R10=PL 
MA=PC10 
RETURN 
REM LOOP 
IF FNFAIL N 
THEN MA=F10(SP10) N 
ELSE MA=PC10 =:\ 
POP=1 REM REV 3 
RETURN 
REM CONT 
MA=PC1O 
RETURN Figure 81. (Cont.) 
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REM 
1250 


REM 
REM 
2000 


REM 
2100 


REM 
2200 


REM 
2300 


REM 
2400 


REM 
REM 
2500 


REM 
REA 
REM 
REM 
3000 


REM 
3100 


. REM 


3200 


REM 
3300 


REM 
3400 


REM 
REM 
4000 
REM 


REM 


REM TUB 

PRINT "TWB NOT PROGRAMMED? 
RETURN 

REM MUX SUBROUTINE 


ON MUX+1 GOTO 2100, 2200,2300,2400 


IF TENTH20 \ 


THEN cc=0 \ 
ELSE cc=1 


RETURN 
IF CN4=0 \ 
THEN CC=0 \ 
LSE cC=1 
RETURN 
IF COMP=0 \ 


THEN C=O \ 
ELSE  CC=1 
RETURN 


cC=1 
RETURN 


REM LEAST SIGNIFICANT 2911 (2911L) SUBROUTINE 
IF ZEROL=0 THEN L4B=0 
RETURN 


REM MORE SIGNIFICANT 2911S (2911H) SUBROUTINE 
ON Si1+1 GOSUB 3100,3200,3300,3400 

IF ZEROH=0 THEN Y11=0 

RETURN 


Y1i=PC11 
RETURN - 


Y11=R11 
RETURN 


Yi1=F11 
RETURN 


IF 110=4 \ 


THEN Y11=VECTOR \ 


- ELSE Y1L1=PL 
RETURN 


REM CLOCK SUBROUTINE 
PCiIO=MA+1 REMOVED REV 4 
IF CN=1 THEN L4BeL4B+1 
IF L4B>15 THEN L4B=0 3 CN4=1 ELSE CN4=0 
IF CN4=1 \ 

THEN PCil#Yiie4l \ 

ELSE PCi1=Y1i1 
IF FE=0 THEN F1i=PC1i1 
<--REV 3 

Figure 81 (Cont.) 


IF PUSH=1 \ 
THEN 


IF SP10>4 \ 
THEN 


IF POP=1 \ 
THEN 


IF SP10<0O \ 
; THEN 


REV 3 --> 
PC10sMA+1 
CP=CP+1 
RETURN 


SP10=SP1041 :N\ 
F10¢(SP10)=PC10 =\ 
PUSH=0 


PRINT °2910 STACK FULL ° =\ 
§P10=3 


SP10=SP10-1 3:\ 
POP=#0 


PRINT "POP EMPTY FILE? °;CP =\ 
SP10<0 


REM REV 4 


Figure B1 (Cont) 
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APPENDIX C 


A simple circu was designed to accommodate tive different 
display formats and also to comply with the European 50Hz TV 
Standard. Figure C1 is the circuit diagram of this additional circuit. 


The following parameters change when the format is changed: 

1) The number of characters/line. 

2) The number of lines/frame. 

3) The number of characters to display (i.e., the address of the 
last character). 

4) The hne frequency and therefore the dot frequency. 


The number of charactersiline is counted by the least significant 
Am2911 sequencer wa the microcode. Therefore, the microcode 
can be changed to change the number of characters/line. The 
number o! lines/frame is counted by a constant, loaded into the 


Am2910 internal counter by the microcode. The microcode can 
be changed to vary the number of lines/frame. 


The scan Is reinitialized to zero when the last address +1 is 
attained. Ug (Am25LS2521) detects this address by comparing 
bits A, through Aj of the character address bus to a constant 
supplied to its B inputs. A table listing these constants is shown n 
Figure C1. By setting the DIP switches according to that table, the 
character scan will reinitialize correctly. The same constant is 
routed through one half of an AmM25LS240 (U24) to the intemal 
data bus. At microprogram address zero, a JUMP MAP instruc- 
tion enables these outputs thereby putting a starting address on 
the bus according to the table in Figure C1. 


The microprogram is shown on Figure C2. 


FREQ. (MHz) 


Figure C1. 


eons? 
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A>TYPE CRT.DEF 

3CKT DEFINITION FILE 
;HY MCSHE M. SHAVIT 
7REV 2 3/6/78 


’ 
TITLE CRT CONTRCLLER ~-DEFINITIONS 


WCRD 24 
t] 
FE: Ler 1VBa1,23X 
ZEROR: IEF 1X,1VE#1,22X > 
$11: DEF EX, 2VZ:Q#,ZOX 
11z: DEF 4X,4VH#,1EX 
CW: DEF * OX,1VBA1,14X 
ZERCL: DE: 10X,1VBa#1,13X 
VB: DEF 11X,1VB4d,12X 
FE: DEF 12X,1VB#d9,11X 
CCEN: DEF 13X,1VEH#,10% 
MUX: TEF 14X,Be#06,8X 
MUX1: TEF 14X,B#12,8X 
MUX.Z: DEF 14X,B#81,8X 
MUXSs DEF 14X ,3411,8X 
PL: DEF 1EX,EVE: 
: ECU Red 
: FQ Bal 
, 
CCUAT: DEF BEL, BAL, BAG, 5X, BA1, B41, BYe ,BHS,1X,zX BF 
COUATE: DEF 3#1,BA1, BAGG, 5X, E41, B41, BAe, BA1,1X, 2X , 8X 
COUATV: LEF Bal, BA1, BAO, 5X, 541,841,541, BH1,1%,2X ,8X 
, > ° 
EAL 
A> 


Figure C2. AMDASM Definition and Assembly Files for the CAT Controller. 


AMDOS/29 AMDASM MICRO ASSEMELER. V1.1 
CRT COATROLLER 


eee 


@ee1 
e20e2 


8283 
O804 
eee5 
@026 
6887 
eeee 
2209 
OLGA 
GOCE 
@8eC 
eeeLr 
ACK 
QBE 


ecer 
6812 
211 
012 
0813 
Q014 


@815 
BG16 


8817 
0018 


@819 
OG1A 
0013 
@21C 
@81D 
OG1E 
@01F 
8928 
O21 
@822 
ACK 
8023 


0624 
8825 
8226 
2827 


3CFT CONTRCLLER MICROFRGGKAY 


i 

EY MOSHE M. SBAVIT 
tREV 2 §/3/7E 
t 


110 Ha2 ;JUMP MAP 


we we weve 


248¢: 118 


HAE 


/ CN L & EB E 


118 


Hac 


/VB & PL D#23 


M2488: 118 
118 
11¢ 
118 
118 
118 
118 
118 
110 
110 

T2488: 118 


110 


H#E 
Ba3 
Has 


HAS 


HAS 
BHS 
HAL 
Bi 
HAS 
Bas 
Hag 


Bee 


/ BBE & VBR 


118 
T1@ 
11¢ 
* 118 
118 
110 


i 
GOBACK: 118 
LASTA: 118 


2464; 118 


BHC 
Bea 
Has 
Hv¥8 
Bac 
HAA 


Ea 
HAA 


24 ROWS 


a6 


/ CN L6& EBE 


118 


Hac 


/VB & PL D#23 


M2464: 118 
118 
118 
118 
118 
118 
11g 
110 
118 
712464; 118 


110 


HAE 
Hed 
Has 
HAS 
B#S 
Ea. 
H#1 
Has 
BAS 
Hag 


Ha6 


/ BB H & YBR 


118 


HAC 


& 
& 
6 
& 
& 
& 
& 
& 
& 
& 
& 
& 
& 
6 
6 
& 
& 
& 
& 
& 
& 
& 
& 


24 ROWS 88 CHARACTERS 68 F/S 


CCEN L & MUXS & S113 & FEL & ZEROH & ZEROL L & 
VB 


$11 8 & FE & ZEROH & ZEROL L & CNLS& BE BS 
$11 2 & FE & ZEROH & ZEROL L 6 CN & HB E S& VB 


CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUXL & COUNT & PL $§ 

CCEN L & MUX® & COUNTER & PL T2488 
CCEN L & MUX2 & COUNTE & PL LASTA 
CCEN L & MUX1 & COUNTE & PL $ 
CCEN BS 


S11 @ & FE & ZEROH & HB H & VB & PL M2486 
S11 ® & FEL & ZEROH & ZEROL & CN H & HB E & VB & PL GOP 


CCEN L & MUX3 & Si1 3 & PEL & ZEROH & ZEROL L & 


511 @ & FE & ZEROH & HB B & WB OS & PL Dre 
CCEN L & MUXS & COUNTY 


CCEN L & MUX1 & COUNTY & PL $ 
COUNTY 


COUNTY & PL D235 
CCEN H & COUNTY 


CCEN H & COUNTE 
CCEN H & FE L & ZERCH L & ZEROL & CN H & EB H & VB 


64 CHARACTERS 68 F/S 


CCEN L & MUXS & S11 36 FE L & ZEROH & ZEROL L & 
VB 

511 @ & FE & ZEROH & ZEROL L & CNL & BB E & 

Sil 2 & PE & ZEROH & ‘ZEROL L & CN & HB H & VB 


CCEN L & MUX1 & COUNT.& PL $ 

CCEN L & MUX1 & COUNT & PL 

CCEN L & MUX1 & COUNT & PL 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUX@ & COUNTH & PL T2464 
CCEN L & MUX2 & COUNTH & PL LASTA 
CCEN L & HUX1 & COUNTH & PL $ 
CCEN B& 


S11 0 6& FE & ZEROH & HB BH & VB & PL M24E4 
Sil @ & PE L & ZEROH & ZEROL & CN H & HB B & VB & PL GOB 


CCSN L & MUX3 & S11 3 & FEL & ZEROH & ZEROL L & 
S11 8 6&6 FE & ZEROH & HB H & VB A & PL DW122 


CCEN L & MUXS & COUNTY 
CCEN L & MUI1 & COUNTY & PL §$ 


118 HeS & COUNTY 


Figure C2 (Cont) 


AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
CRT CONTROLLER 


ee@2e 
@829 


O02A 
Q823B 


@82C 
82D 
@O2E 
@22F 
8830 
8031 
e232 
0833 
ACK . 
0834 


8835 . 


O936 
8837 
8228 
6839 
GO3a 


QO3E 
883C 


@83D 
O83E 
OQ3F 
, 8842 
6241 
0042 
0243 
0044 
ACK 
0045 


0046 
e247 
8048 

' 9049 
OC4k 
0248 
@04C 

- Q84D 
Q04E 
004F 


710 HaC & COUNTY & PL D235 
110 HWA & CCEN H & COUNTY 


Ci we we ws we we we 


2432: -1186 BNE 
/ CNL S& HB &E 
118 Bac 

/YB & PL D#22 
M2432: 118 ERE 
110 HRS 


118 Hes. 


118 Hai 
110 H#1 
11@ H¥Z 
Tle B4&3 
T2432: 112 Hao 


11@ H#é 
7 HBH & VBE 
118 HAC 
Ii@ a4 
Tid BRS 
110 Has 
718 Bac 
110 H¥A 


CA we we we we we 


1632: 118 H#6 
/ CNL & BB B 
118 Hc 

‘VBS PL D#15 
M1632: 1186 H#E 
718 B#3s 

118 Bas 

-110 H#1 

1190 H#1 

116 HWS 

110 BRS 

T1ES2: 110 Hw9 


110 H#6 
/ BB H & VBS 
110 HAC 
110 H#4 
110 Hw#3 
116 H#8 
I1@ BAC 
110 H#4 
110 Hw 
Ii@ Hwe 


I10 HAC. 


110 HWA 


& 
& 
& 


6 
& 
& 
& 
é 
& 
& 
& 


& 


& 
& 
& 
& 
& 
& 


& 


& Vv 


& 


& 
& 


-& 


& 
& 
& 
& 
& 


& 


& 
& 
& 
& 
& 
& 
& 
& 
& 
& 


24 ROWS 32 CHARACTERS 60 F/S 


cane L & MUX3 & S11 3 & FE L & ZEROH & ZEROL L & 
611 @ & FE & ZEROH & ZEROL L & CNL & BBE S 


$11 2 & FE & ZERCB & ZEROL L & CN & HB E & VB 

CCEN L & MUX1 & COUNT & PL $ 

CCEN L & MUXi & COUNT & PL $ 

CCEN L & MUX@ & COUNTE & PL T2432 

CCEN L & MUX2 & COUNTH & PL LASTA 

CCEN L & MUX1°& COUNTH & PL $ : 

CCEN AR & Si1 @ & FE & ZEROK & HB E & VB & PL M2432 =~. 
$11 @ & FE L & ZEROK & ZEROL & CN BH & EB E & VB & PL GOB 


CCEN L & MUX3 & S1i 3 & FZ L & ZEROH & ZEROL L& 


$11 @ & FE & ZEROH & HB H & VB EH & PL D#?4 
CCEN L & MUXS & CCUNTY 

CCEN L & MUX1 & COUNTY & PL $ 

COUNTY 

COUNTY & PL D235 

CCEN BE & COUNTY 


46 ROWS 32 CHARACTERS 60 F/S. 


CCEN L & MUXS & $1136 FEL & ZEROH & ZEROL L & 
B . . . 

$11 @ & FE & ZEROH & ZFROL-L& CNL & HB HS& 

S11 2 & FE & ZEROH & ZEROL L & CN & HB SE & YB 


CCEN L & MUX1 &.COUNT & PL $ 
CCEN L & MUXi & COUNT & PL $ 
CCEN L & MUX@ & COUNTE & PL T1632 
-CCEN L & MUX2 & COUNTH & PL LASTA 
CCEN L & MUX1 & COUNTH & PL $ 

ES 


CCEN L & MUX3 & Sii 3 & FE L & ZEROH & ZEROL L 6. 


$11 @ & FE & ZEROH & HB’ & WB H & PL D250 
CCEN L & MUX3 6 COUNTY 

CCEN L & MUX1 & COUNTY & PL $ 

COUKTY 

COUNTY & PL D#4B 

CCEN.L & MUX3 & COUNTY 

CCEN L & MUX1 & COUATYV & PL $ 

COUNTY 

COUNTY & PL DM#15 

CCEN H & COUNTY 


16 ROWS 16 CHARACTERS 60 F/S 


Figure C2 (Cont.) 
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AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
CRT CONTROLLER 


0508 
@051 


@852 
8853 
0054 
8es5 
@85E 
885% 
eece 
ACK 
BREG 


@@5A 
OB5B 
@65¢ 
@@5D 
BB5E 
@e5F 


Bore 
eere 


@8Fa 
O2F3 


@8F9 
88FS 


OOFB 
OOFB 


OerD 
O@FD 


8198 


9128 
0101 


81¢2 
8123 
0104 
8185 
@12E 
0187 
0128 
8129 
G1G0A 
0108 
@10G 
ACK 


$1616:- 116 B¥6 & CCEN L & MUX3 & S11 3 6& FE L & ZEROH & ZEROL L & 
/ CNL & BBE S& VB 
1i@ H¥C & Sil @ & PE & ZEROH & ZEROL L & CNL & HB E & 
/¥B & PL D#15 
Mi€16: 11@ B¥E & Sil 2 & FE & ZEROH & ZEROL L & CN & HB A & VB 
110 BaS & CCEN L & MUX1 & COUNT & PL $ 
118 H#1 & CCEN L & MUXO & COUNTH & PL T1616 
116 H#i & CCEN L & MUX2 & COUNTH & PL LASTA 
116 Hw3 & CCEN L & MUX1 & COUNTH & PL $ ; 
118 H#? & CCEN H & S11 @ & PE & ZEROH & HB A & VB & PL M1616 
T1616: 110 HA9 & $11 @ & FE L & ZEROH & ZEROL & CN B & BB E & VB & PL GOP 
118 HW6 & CCEN L & MUX2 & S11 3 & FE L & ZEROH & ZEROL L & 
/ HBB SG VB E 
118 H¥C & S11 @ & FE & ZEROH & BB H & WB OH & PL D#203 
110 H#4 & CCEN L & MUX3 & COUNTY 
110 HW3.& CCEN L & MUX1 & COUNTY & PL $ 
11@ B#8 & COUNTY 
118 HWC & COUNTY & PL’ D#15 


110 Hwa & CCEN H & COUNTY 


ORG HAGFO 324788 
110 HWS & CCEN H & PL S2488 


ORG HAGPS 5 24%64 
T16 H¥3 & CCEN BH & PL S2464 


ORG Haerg 324932 
118 Hw3 & CCEN H & PL S2432 


ORG HWOFB 8 3:16*32 
110 EWS & CCEN H & PL $1632 


ORG HWOFD 416%16 
110 HWS & CCEN H & PL S1616 


58 F/S ROUTINES 


ORG Hw100 
24 ROWS @6 CHARACTERS 50 ¥/S 


24Q0EB: 110 HWE & CCEN L & MUX3 & $11 3 & FE L & ZEROH & ZEROL L & 
/ CN L & BBE & VB 
116 H¥C & S11 6 & FE & ZEROH & ZEROL L & CN L & HB SE & 
/VB & PL D#23 ; 
M2482E: 11¢ HHE & S11 2-6 FE & ZEROH & ZEROL L & CN & HB E & VB 
116 HWS & CCEN L & MUX1 & COUNT & PL $ 
11@ Bw3 & CCEN L & MUX1 & COUNT & PL $ 
110 H#2 & CCEN L & MUX1 & COUNT & PL $ 
118 B#3 & CCEN L & MUX1 & COUNT & PL z 
116 Hw3 & CCEN L & MUX1 & COUNT & PL 
118 H#1 & CCEN L & MUXO'G& COUNTH & PL T248GE 
118 H#1 & CCEN L & HUX2 & COUNTH & PL LASTA 
110 Hw3 & CCEN L & MUX1 & COUNTH & PL $ 
118 Had & CCEN H & Sil @ & FE & ZEROH & HB H &-¥B & PL M2480E 


72460B: 110 B49 & S11 6 & FE L & ZERQH & ZEROL & CK BH & HB H & VB & PL GOB 


Figure C2 (Cont) 


AMDOS/29 AMDASM MICRO ASSEMBLER, ¥1.1 
CRT CONTROLLER 


@18D 110 H#6 & CCEN L.& MUX2 & S11 3 & PE L & ZEROH & ZEROL L & 
/ BBE & VB SE 
2195 116 HwC & S11 9 & PE & ZERO & HB H & WB H& PL Da200 = S ITERATES 
201 TIMES 
010P 11@ Hw4 & CCEN L & MUX3 & COUNTY 
0110 119 H#3 & CCEN L & MUX1 & COUNTY & PL $ 
@111 118 H#8 & COUNTY 
8 e 
@112 118 HwC & COUNTY & PL Dse239 
0113 110 He4 & CCEN L & MUX3 & COUNTY 
2114 I1@ H#3 & CCEN L & MUX1 & COUNTY & PL § 
@115 110 E#¥8 & COUNTY 
i} 
0116 110 E#C & COUNTY & PL D423 
0117 118 Hwa & CCEN HB & COUNTY 
8 
9 ‘ 
3 24 ROWS 64 CHARACTERS 50 F/S 
' 
, 
0118 S2464E: 1160 H¥6 & CCEN L & MUX3 & S11 3 & FE L & ZEROH & ZEROL L & 
7 CNL & EB & VB 
@119 110 BWC & Sil 6 & FE & ZEROH & ZBROLL & CNL & EBES 
/¥B & PL D#23 : 
@11A M2464E: 110 HWE & S11 2 & FE & ZEROH & ZEROL L & CN & EB EB & VB 
@11B 119 H#3 & CCEN L & MUX1 & COUNT & PL $ 
@11C I1@ Hw3 & CCEN L & MUX1 & COUNT & PL $ 
@11D 110 H#3 & CCEN L & MUX1 & COUNT & PL $ 
@11E 110 Hw3 & CCEN L & MUX1 & COUNT & PL $ 
@11F 118 Hwi & CCEN L & MUX@ & COUNTH & PL T2464E 
120 110 H#i & CCEN L & MUX2 & COUNTH & PL LASTA 
121 116 H#3 & CCEN L & MUX1 & COUNTE & PL $ 
@122 116 H¥3 & CCEN BH & Sii @ & FE & ZEROH & BB B- & VB & PL M24E4E 
0122 T2464E: 110 Hw9 & S11 @ & FE L & ZEROH & ZEROL & CN H & HB H & VB &.PL GOB 
ACK ; . 
0124 110 Hwé & CCEN L & MUI3 & S11 3 & FE L & ZEROH & ZEROLL & 
/ HB E & VBE 
0125 110 H¥C & Si1 6 & FE & ZEROH & HB H & VB E & PL D#200 
0126 110 Hw4 & CCEN L & MUX3 & COUNTY 
@127 110 H#3 & CCEN L & MUX1 & COUNTY & PL $ 
0126 116 Hw8 & COUNTY 
’ 
@129. 110 H¥C & COUNTY & PL D#167 3369 
0124 110 HA4 & CCEN L & MUX3 & COUNTY 
-212B 110 Hw3 & CCEN L & MUX1.& COUNTY & PL $ 
@12C 110 H#8 & COUKTY 
, 
@12D 116 HAC & COUNTY & PL D#23 
@12E * 110 H¥A & CCEN BH & COUNTY 
, 
i 
’ . 
; 24 ROWS 32 CHARACTERS 50 F/S 
‘ 
’ 4 
012F S2432E: 110 H¥G & CCEN L & MUX3 & S11 3 & FE L & ZEROH & ZEROL L & 
/ CN-L & BB H & VB 
130 119 H#C:& S11 8 & FE & 2EROH & ZBROLL& CNL & BBEE 
: /VB & PL Da23 
@131 M2432E: 110 HWE & S11 2 & FE & ZEROE & ZEROL L & CN & HB H & YB 
8122 110 H#¥3 & CCEN L & MUX1 & COUNT & PL $ 
8133 110 H¥3 & CCEN L & MUX1 & COUNT & PL $ 
134 110 Hwi & CCEN L & MUX6.& COUNTE & PL T2432E 
0135 110 He#l' & CCEK L & MUX2 & COUNTH & PL LASTA 
0136 116 H#3 & CCEN L & MUX1 & COUNTE & PL $ 
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AMDOS/29 AMDASM MICRO ASSEMBLER, ¥1.1 
CRT CONTROLLER 


8137 .116 Bw3 & CCEN H & $11 @ & FE & ZEROH & HB EB & VB '& PL M2432E 


8138 T2452: 110 H¥9 & $11 89 & FE L & ZEROB & ZEROL & CN BH & HB. & VB & PL COB 
ACK 


9139 118 B¥E & CCEN L & MUX3S & S11 3 & FE L & ZEROH & ZEROL L & 
; / HB EH & VBE . F 

135A 116 BwC & Si1 @ & FE & ZEROH & HB H & WB HA & PL De#224 

8138 118 He4 & CCEN L-& MOXS & COUNTY 

@13C 118 H#3 & CCEN L & MUX1 & COUNTY & PL $ 

@13D 118 Hes & COUNTY 

G13E 118 HeC & COUNTY & PL D#23 

Q13F T18 B¥A & CCEN BH & COUNTY 


16 RQWS 32 CHARACTERS £8 F/S 


@148 S1E32E: 118 Hee 
/ CNL & BBE 
8141 ‘118 Bec 

/¥B & PL D415 


@142 M1IES2E: 116 H#E 


CCEN L & MUX3S & S11 3 & PE L & ZEROH & ZEROL L & 
vB tay 
S11 © & FE & ZEROH & 2BROL L & CN L & BB ES 


S11 2 & FE & ZEROH & ZEBROL L & CN & HB H & VB 


O54 118 Hwa 


& 
& 
& 
& 
8143 118 B#z & CCEN L & MUX1 & COUNT & PL $ 
0144 116 H¥3 & CCEN L & MUX1 & CCUNT & PL $ 
0145 116 H#1 & CCEN L & MUX® & COUNTH & PL T1632E 
+ 0146 118 H#1 & CCEN L & MUX2 & COUNTH & PL LASTA 
8147 118 HW3 & CCEX L & MUX1 & COUNTH & PL $ 
0146 110 Hw3 & CCEN BH & S11 @ & FE & ZEROR & HB H & VB S PL M1E32E 
este T1632E: 110 H#o & $11 6 & FE L & ZEROH & ZERCL & CNH & HB H & VB & PL GOB. 
A 
O14A ; Bue & CCEN L & MUX3 & S113 & FEL & ZEROH & ZEROL L & 
HEE S& E 
614B _ 118 HAC & S11 @ & PE & ZEROH & HB H & VB H & PL D250 
@14C 118 H#4 & CCEN L & MUXS & COUNTY © 
614D 116 H#3 & CCEN L’& MUX1 & COUNTY & PL“$ 
G14E 118 H#S & COUNTY 
@14F 110 HaC & COUNTY & PL D#223 3475 — 
6158 116 H¥4 & CCEN L & MUIS & COUNTY 
2151 118 HAS & CCEN L & MUX1 & COUNTY & PL $ 
8152 118 H#8 & COUNTY 
8153 118 H#C & COUNTY & PL DA15 
& 


CCEN H & COUNTY 


16 ROWS 16 CHARACTERS 590 F/S 


Gwe we we we we 


2155 Si€i6E: 118 HW6 


CCEN L & MUX3 & S113 6 FEL S& ZEROE & 2EROL L& 
/ CNL & HB HEH S& VB 


‘CCEN L & MUX1 & COUNTY & PL $ 


0162 - 118 Hee & COUNTY 


& 
& 
0156 11@ BWC & S11 @ & FE'& ZEROH & ZEROL L & CN L & HB # & 
/YB& PL D#15 
6157 Mi€1EE: 116 H#¥E & S11 2 & FE & ZEROH & ZEROL L & CN & HB B & VB 
2156 118 He3 & CCEN L & MUXI1 & COUNT & PL $ 
8159 118 H#1 & CCEN L & MUX@ & COUNTH & PL T1616E 
G154 118 Hat & CCEN L & MUX2 & COUNTH & PL LASTA > 
@15B 110 H¥S & CCEN L & MUX1 & COUNTH & PL$ - 
@15C 118 H#S & CCEN H & Si1 @ & FE & ZEROH & HB H & VB & PL M1616E 
@15D Ti61EE: 118 Hwe & S11 6 & FE L & ZEROH & Z2EROL *. CN H & BB A & VB & & PL con 
ACK 5 
@155 . 116 BW6 & CCEN L & MUXS & Si1 3 & FE.L & ZEROH & ZEROL L & 
/ BB H & YB SE : ; 
B15F 118 H#C & S11 @ & YB & ZEROH & BB H & YB E & PL D200 
8168 118 H#4 & CCEN L & MUX3 & COUNTY 
6161 116 H#3 & 
6 
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AMLOS/29 AMDASM MICRO 


CRT CONTROLLER 


; 
0162 
0164 
0165 
0166 


G1€7 
6168 


@1FO 
@1F@ 


@1Fz 
G1FS 


=e we 


@1F9 
@1F9 


O1FB 
_ @1FB 


01°F 
@1FpD 


Tig HAC 


116 Has 
118 H#8 


ASSEMBLER, V1.1 


& COUNTY & PL D#121 
118 H#4 & CCEN L & MUXS3 & COUNTY 

& 

& 


116 HAC & COUNTY & PL D#15 
116 H#¥A & CCEN H & COUNTY 


ORG R#1F9 


324786 


J10 H#3 & CCEN B & PL S24808 


ORG" H#1F3 
116 H4¥3 & CCEN 


ORG HA1F9 


p24a*E4 
BH & PL S2464E 


5 24¥*32 


118 H#3 & CCEN H & PL S2432E 


ORG H#1F3 
I11@ H#2 & CCEN 


ORG H#1FD 
110 HAS & CCEN 


; 
END “3 


28808 
@681 
ese2 
8283 
6884 
e¢ee5 
OBBE 
8207 
OBGE 
O080S 
OOlA 
O80B 
@26C 
eeeDd 
GOCE 
OGOFr 
8810 
0811 
0012 
8213 
0014 
8015 
2016 
0017 
O81E 
8219 
OO1A 
0818 
@21C 
@81D 
GO1E 
@giFr 
8820 
0821 


XXXXEOSLSXXIXIXXAXX 
@1116118X6001011 
11281100X9801XXX 
11181116X1001XXX 
11600811X11060198 
11802011X1160010 
11800011X1128019 
11002611X1190¢18 
112860011X119001¢ 
118096001X11919080 
11980001X1101091 
1100001111101010 
11060011XXXO11XX 
@1801221X1101XxXxX 
@1118118XX911811 
11061180XXX11XXX 
11080100X1111011 
11680011X1111010 
11601000X1111XXX 
11001160X1111XXX 


11801010X11111Xx 


11081010X11911XxX 
@89XX1810X11011XX 
@1110118X9801611 
11801160XG8001XXxXX 
11121110X1G6G1XXX 
11960011X1180018 
11980801111180019 
118000011X1180018 
1180000811X1180010 
11862881X1101908 
11860€08801X11801601 
110080011X1181018 
11600011XXXO11XX 


XXXIXAXX 
XXXXXXIXX 
62010111 
XXXXIXXY 
822001298 
62600121 
08800119 
08820111 
08881888 
8091191 
08818119 
68801011 
8600611 
8219181 
XXXIXXAXX 
1880186198 
XXXXAXIX 
08018001 
XXXXXXIX 
20010111 
XXXXXAXX 
XXXXXIXAT 
XXXIXXAXXT 
XXXAXXXI 


00810111 - 


XXXXAXXAL 
8811818 
080011011 
08811120 
@0011101 
88102019 
68818118 
88188008 
00811881 


516*32 
BR & PL S1632E 


116*16 
H & PL S1€16E 


2822 
6823 
8824 
8225 
o82€ 
8827 
@22E 
8829 
@82A 
8828 
882C 
o82r 
O82E 
@02F 
0838 
@821 
8832 
6833 
6834 
@835 
O236. 
0837 
6838 
883¢ 
GQ3A 
O83B 
883C 
@83D 
OB3E 
OO3F 
0648 
0041 
0842 
08435 


3323 


CCEN L & MUX1 & COUNTY & PL $ 
, COUNTY 


*@1601001X1161XXX 


61110118X%811611 
1188118@XXX11 XXX 
11808180X1111011 
11098011X111181¢8 
11661606X1111XXxX 
11001100X1111X1X 
11021816X1111112 
21118118X8881811 
1108119@X8001XXX 
11161116X1@01XXX 
11622011X1180018 
11808011X1128018 
1180802012X1181829 
11088081X1181801 
11800811X1181018 
11000013XXXO11IX 
@18801601X1181XXX 
@11121102X811011 
11801160XXX11XXX 
11086108X1111811 
11008011X1111918 
118661000X1111X1X 
11001100X1111xXxx 
1160101@X111111X 
@111011910091911 
11881180X9001XXX 
11101110X1601X1X 
11088011X1100010 
1180001111188018 
11800001X11012008 
11890821X1181001 
11860811X1181810 
11060011XXXB11XX 
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02010101 
XXIXIXXXAX 
81111018 
AXXXIIXIX 
98180116 
XXXXIXIX 
9081111 
IXXIXIIX 
XXXXIXXXX 
g001@111 
XXXIXIXT 
00101101 

00161110 

60110011 

02810110 

09110281 

20101108 

00010101 

XXXIXXIXX 
81001019 
XXXXXXXX 
80116111 

XXXXXXXX 
00010111 

XXXIXXIXT 
XIXXXIXX 
02001111 
XXXXXIIIX 
00111118 
00111111 
81880100 
22010119 
818026819 
99111161 


Semeemeeieinetintee tina tanaeant 


AMDOS/29 AMDASM MICRO ASSEMBLER, Vi.1 
CRT CONTROLLER 


0044 0100100111161XXX 68010161 @11E 11900011X1100018 0006111106 
6045 01110110XXG11011 XAXXXIXIX @11F 11800001X1181890 60100011 
0846 11001160XXX11XIIX 11111818 ‘9128 11906081X1101901 98010110 
0847 118280100X1111011 XXXXXXXX 0121 11600011X1181818 9819¢001 
@046 11002811X1111018 818012000 8122 116800011XXXG11XX 89011819 
6649 11601000X1111XXX XXIXIIIIX @122 €1801001X1101XXX 80016101 
@G4A 11061100X1111xXXX 88118880 8124 91119110XXG811811 XXXXXXXIT 
@G4B 11000196X1111011 XXXXXXIXX 0125 11001100XXX11XXX 11001909 
O84C 110000112%1111818 81991199 @126 11008180X1111011 XXXXIXXIX 
OB4D 11G601CGGX1LLAXXIX XAXIAAXIX @127 11€008011X1111018 20190111 
@@4E 11201100X1111XXX 96001111 8128 11801600X1111XXX XAXXXXXXIX 
BB4F 110861010X11111XX XXXXAXXAIX @12S 11061100X1111XXX 10100111 
@G5O B1118118X8G810811. XXXXXXXX 812A 116001800X1111011 XXXXXXXX 
0851 11091108X@081xXXX 98081111 012B 11800811X1111018 69101011 
@852 11101118X18@1XXX AXXXIAXAX @12C 1196100OX1111XXX XIXXXXXX 
@@52 110808011X1188018 91010011 @12D 11961100X1111XXX 08010111 
2054 11620001X11018e8 91911908 G12E 11081810X11111XX XXXXIXAIX 
@855 110008081X1181001 969190110 @12F ©1110110X0801011 XXXXXXXX 
@856 11080811X1181818 81910119 8138 110801190X6001XXX 96012111 
@@57 11060011XXXG11XX. 818180198 @131 111801110X1@01XXX XAXXXXXAIX 
8058 81081801X1181XIX 69010101 @132 11808011X1108019 60110010 
@859 81110118XX011011 XXXXXXXIX @133 11060011X1100018 08110611 
@O5A 110811G0XXX11XIX 11901911 8134 11000081X1181908 08011198089 
@@5B 11000100X1111011 XXXXIXXXIX 8135 11000601X1191001 80019119 
@25C 110800811X%1111818 91011109 @13E 118600811X1101019 06110110 
@85D 1160188GX1111XXX XAXXXAXXIX @137 11000011XXXG11XX 68118001 
BOSE 11868118011111XXIX 9eeei1i1 8138 8€1901001X1181XXX 68818101 
O@SF 11001010X11111XX XIXXXIIY @139 61110116XX011011 XXXXXXXIYX 
OGL XXXXGGLLXIXXXXAXX eesecee1 G13A 11081108XXX11XIX 11100008 
OOF3 XAXXXGG11XAXXXXLXX 999010111 @13B 11660190X1111011 XXXXIXXXI 
BBFO XXXXGGLIXXXXX1XX 801019198 B13C 11060011X111191@ 981111208 
Q@PB XXXIX@G1IXXIXXAXAUXX 88111011 @13D 11001060X1111XXX XXXIIIIX 
@OFD XXIXGSLIXIIXX1XX G1g1ieesd @13E 11001100X1111XxXX 89010111 
8188 01110118X8881011 XXXXXXIX B13F 11001016X11111XX XAXXXXXAX 
@1@1 11001198xee¢1xXxXX 98010111 0148 @1116118X@001G11 XXXXXXXX 
8182 11121119X1001XXX XXIXXXIIY @141 11061106x@001XXX @2001111 
8162 110628011%1190018 @9000011 @142 1116111GX1@@1XXX XXXXIXXX 
8184 11600011X119¢01e 9oee0100 8143 11900011X1180018 81900011 
9185 11208811X1190019 96900101 0144 11000011X1199010 01000100 
8186 11608011X1196018 ¢20¢00119 0145 11800001X1191908 01001001 
@187 11089011X1100018 90000111° @146 11000001X1101001 209101192 
@108 11080001111916800 ga061190 8147 11000011X1191018 01000111 
@18S 11008001X1191901 99919119 0148 11900011XXX@11XX 81000010 
@1@4 110080112%1191018 99801¢10 0149 91001901X1101XXX 980810181 
@18B 11000811XX1011XX geggeaaie @14A 01110110XXG11011 XXXXXXXII 
B10C 81601001X1101XIX 99010101 @4tB 110@110OXXX11XAXX 111110108 
B18D 61110118XX811011 XxXxXxIIX @14C 11000100X1111011 XXXXXXIXX 
@10E 11605100XXIX11XXX 11981902 @14D 11000011X1111818 81801101 
@10F 116860190X1111911 xxxIxxXX @14E 11601000X1111XXX XXAXXXXIX 
8118 11800011%11112018 aae1g0e0 O14F 11001100X1111XXX 11011111 
@111 11081902X1111XXX XXXIXXXX ‘O15 11606160X1111011 XXXXXXXIX 
8112 119601100X1111XxXX 11101111 8151 118900011X1111818 91819001 
@113 11060190X1111011 XXXXXXIX @152 110801000X1111XXX XXAXXXIXX 
@114 118080011X11119018 92010100 @152 110811060X1111XXX 80801111 
9115 11801000X1111XXX AXIAXITY 8154 11661616X11111XX XAXXXXXXIL 
@11E 1100110902%1111XXX 90010111 @155 891116116X9801011 XXXXXIXXX 
@117 11061610X11111XX XxXXXxITX 8156 11001199X8801XXX 98001111 
0118 6111011610001011 xxxxr1xxXXIx @157 11181110X1061XXX XXXIXXXIXIT 
@119 11801160X80801XIX 99019111 @156 118008011X1189016 8101100898 
@11A 11101116X1G801XXX XXXIXIXT . @15€ 11000001X119016008 @1011101 
811B 11080011X1128801¢8 90011011 @15A 11028081X1181801 8620121108 
@11C 11860011X11600128 890111990 . @15B 11800011X1101818 81811911 
@11D 110080111%1180010 60011101 815C 11080011XXX@11XX 61018111 
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87 


AMIOS/29 AMDASM MICRO ASSEMBLER, V1.1 
CRT CONTROLLER 


@15D 
@15E 
O15F 
81628 
@161 
0162 
@1€3 
0164 
@165 
@166 
G1€7 
8168 
@1FO 
Q1FS 
@1FS 
Q1FS 
@1FD 


ENTRY 


@1001001X1101XXX 
@1110110X%811011 
1180@11@O@XXX11XXX 
11000100X1111611 
118000011X1111818 
1108 1000X1111XXX 
114@1100X1111XXX 
11000100X1111011 
11880011X1111018 
11081000X1111XXX 
11001108X1111XXX 
11081010X11121XX 
XXXXOG1LXAXXAXAXX 
XXXXGO11LXXAXX1XX 
XXXXOOLIXAXXX1XX 
XXXXGO1AXXXIXTAX 
XXXXGO1LXXXXX1XX 


POINTS 


SYMBOLS 


GCBACK e015 
5 


L 


G@21 
22208 


LASTA QB1E 
F1i€16 e252 
MIi€16E @157 


M1E32 @O3D 
M1E€S2E 0142 
M2422 e22c 
ME432E @131 aun 
M2464 8819 


M2464E O1l1lA 


M242 


Cees 


M2486E 8182 


$1€16 


0058 


S1616E @15§ 
$1€32 O23 
$1622E 2148 
$2422 QBZA 
$2432E @12F 


SZ4E€4 


2017 


SEZ4ES6E 0118 


$2468 


8201 


S246@E 8188 


T1616 


eo5e 


T1E16E 815D 


T1622 


O044 


TIEI2E 2149 


T2432 


8033 


T2432E 2138 


T2464 


@Bz2 


T2464E Q12e 


T2482 


Goer 


T2400 E 618C 


TCTAL 


PHASE 2 ERRORS = 


08018101 
XXXXXXXX 
11001088 
XXXXXXIX 


1100001 


XXXXXXXX 
@1111001 
XXXAXXXX 
1180101 
XXXXXXIXX 
@80881111 
XXXXXXIXX 
O8220088 
088110808 
60101111 
81880080 
81610161 - 


) 
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The Microprogrammed CRT Controller was builton a System 29 
universal card and exerc:sed by the System 29 support proces- 


APPENDIX D 


sor. An Am90804 program was wnitten to fill the character mem- 02. 
ory. Figure D1 1s the I:sting o! this program. inorder to observe the 


PROGRAMM TO URITE INTO CHARACTER MEMORY 


BY MOSHE M. SHAVIT 
sREV 0 3/6/78 


, 
O1FF = STACK  EQU AFFH = sSTACK POINTER ; 
OOFF = FAR. EQU OFFH = FIRST ADURESS REGISTER 0/P PORT 
8000 = CHARAD EQU BOOOH CHAKACTER MEMORY STARTS HERE 
0200 : ORG STACK+1 ;UORKING SPACE ABOVE STACK 
0200 FA DS 1 :FIRST ADURESS 
0201 CURAD DS 2 ;CURRENT ADDRRESS 
0203 FIL ps «5 2 ZA(FIRST CHARACTER IN LINED 
a : 
0100 ORG 100H —s- PROGRAM STARTS HERE 
0100 31FFO1 LxI SP STACK 
0103 213087 LXxI H,730H*CHARAD =sLAST LINE, FIKST CHAKACTER 
0106 220302 SHLD FIL sIN “FIRST CHARACTER IN: LINC" BUrFER 
0109 220102 SHLD CURAD AND IN CURRENT ADURESS BUFFER 
019C AF XRA A ;CLEAR A 
O10D D3FF ouT FAR $START ADDRESS#0 
010F 320002 STA FA 3SAVE IN BUFFER 
0112 CD1B01 CALL CLEAR ;CLEAR ALL CHAR. MEMORY 
0115 CD2c01 MAIN CALL  CHARIN ;READ CHARACTER AND PUT IN CHAR. 
0118 C31501 SMP MAIN DO IT AGAIN 
; 
’ 
011B 0600 CLEAR MV B,0 :DATASU : 
011D 210080 LXI H, CHAR :FIRST CHARACTER ADDRESS 
0120 110008 LxI- DB, 2048D ;COUNTER 
0123 70 CLEAR1 MOV H,B ;CLEAR THAT ADDRES 
0124 1B DCX D 3COUNT ; 
0125 23 INX H sNEXT ADDRESS 
0126 7A HOV a,b ;CHECK 
0127 B3 ORA ; IF DONE 
0128 C22301 JNZ. CLEAR1 NO. CONTINUE 
012B CP RET #YES. BACK TO CALLER 
, 
, 
012¢ 0c01 CHARIN MMVI C1 ;CP/M READ CODE 
012E CLOS00 CALLS 3CP/M READ ROUTINE 
0131 FE1A CPI 1AH 3CTL-2? 
0133 CA0DDdO JZ 0 ;RETURN TD CPM IF YES 
0136 200102 LHLD  CURAD =; FETCH CURRENT ADDRESS 
0139 FEOD CPI ODH 3CR? 
013B CAs4o1 JZ CRLF ss YES. 
013E 77 nov HA ;URITE CHARACTER 
013F 23 INX H 3 INCREMENT 
0140 220102 ° SHLD  CURAD ;STORE IN BUFFER 
0143 C9 RET #BACK TO CALLER 
: 
0144 ES CRLF PUSH H 
0145 DS PUSH OD 
0146 CS PUSH —B 
0147 F5 PUSH PSU 
0148 1E0A HVI E,0AH 
014A ‘0E02 VI C,2 
014C CD0500 CALLS 
O14F FL POP PSU 
0150 Ci PoP B 
0151 Di POP Bb 
0152 El PoP H ;ROUTINE TO ECHO LF 
0153 £B XCHG 


sSAVE CURRENT ADDRESS IN DE 


Figure D1 


correct output of the controller, an oscilloscope or CRT monitor 
can be connected through an adaptation 


Circuit shown in Figure 


MEMORY 


0154 
0157 
015A 
0158 
015C 
015E 
O15F 
0160 
0163 
0164 
0165 
0168 


0169 2 


016A 
014D 
016E 
0170 
0172 
0175 
0176 
0178 
017B 
O17E 
0181 
0184 
0187 
0169 
016B 
O16E 
0191 
0193 


0194 
0195 


015000 
2A0302 
oO? 


210080 
220302 
220102 
3A0002 
C4605 
FE76 
cc9401 
320002 
DIFF 
c? 


AF 
c? 


CRLF2 


CRLFS 


CRLF4 


CRLFS 


LXI 
LHLD 
DAD 
XCHG 
MVI 
nov 
CHP 
JNZ 
MoV 
CHP 
Jz 
nov 
INX 
June 
Mov 
ANI 
cpr 
JNZ 
Mov 
CPI 
JNZ 
LXI 
SHLD 
SHLD 
Loa 
ADI 
CPI 
cz 
STA 
ouT 
RET 


XRA 
RET 


A,H 
D 
CRLF 
A,L 


E 
CRLF4 
4,B 

H 
CRLF2 
a,H 


CRLFS 
at 
BOH 
CRLFS 


H,CHARAD 


FIL 


Oscilloscope Connections. 


89 


380 CHARACTERS/LINE 
sFETCH FIRST CH. IN LINE ADDRESS 
gHL= ACNEXT LINE’S FIRST CH. ADD.) 
sHL=CURRENT ADDR. ,DE=ACNEXT LINE FIRST CH. ADDR) 
sDATA=O 
sMORE SIGNIFICANT CURRENT ADDRESS 
s™ASAT LINE FIRST ADDRESS? 
3NO 
#LESS SIGNIFICANT CURRENT ADDRESS 
sIS CURRENT LINE FULL? 
sYES 
sSTORE O AT THAT ADDRESS 
sINCREMENT ADDRESS 
3GO CHECK AGAIN 
sMORE SIGNIFICANT PART OF ADDRESS 
sONLY 3 LESS SIGNIFICANT BITS 
sLAST LINE PASSED? 
sNOT YET 
sLESS SIGNIFICANT BYTE OF ADDRESS 
sARE WE AT 780H=1920D? 
sNOT YET, SKIP 

sYES, START WRITING AT BEGINNING OF CH. HEM. 
sSTORE IN FIRST CH. IN LINE BUFFER 
sAND IN CURRENT ADDRESS BUFFER 
sFETCH FIRST VISIIBLE CHARACTER ADDRESS 
sSCROLL 
3T00 MUCH? 
sYES 
sSTORE IN FIRST ADDRESS BUFFER 
sLOAD REGISTER 
sRETURN TO CALLER 


sFIRST ADDRESS=0 


Figure D1 (Cont) 


Ball Monitor Interface. 


Figure D2. 
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The Data Path 
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INTRODUCTION 


Tho heart of most digital arithmetic processors is the arithmetic 
logic unit (ALU). The ALU can be thought of as a digital subsys- 
tem that performs various arithmetic and logic operations on two 
digital input variables. The Am2901A and Am2903 are Low 
Power Schottky TTL arithmetic logic unit/function generators that 
perform arithmetic/logic operations on two four-bit input vari- 
ablos. in most ALUs, spoed is generally a key ingredient. There- 
fore, as much parallolism in the operation of the arithmetic logic 
unit as possible is desired. 


The Am2901A and Am2903 ALUs are designed to operate with 
an Am2902A carry lookahead generator to perform multi-tevel full 
carry lookahead over any number of bits. Therefore, the devices 
have both the carry generale and carry propagate outputs re- 
quirad by the Am2902A carry lookahead generator. The devices 
also have the carry output (C,44)'and a two's complement over- 
flow detection signal (OVR) available at the output. The net result 
Is that a very high-speed 16-bit arithmetic logic unit/function 
generator can be designed and assembled using four of these bit 
slice devices and one Am2902A (the Am2902A is a high-speed 
version of the ‘162 carry lookahead generator). In addition, the 
Am2901A and Am2903 provide a minimum of 16 working regis- 
ters for providing source operands to the ALU. 


UNDERSTANDING THE BASIC FULL ADDER 


The results of an arithmetic operation In any position in a word 
depends not only on the two-input operand bits at that position, 
but.aiso on all the lessar significant operand bits of the two input 
variables. The final resuit for any bit, therefore, is not available 
until the carries of all the previous bits have rippled through the 
logic array starting from the least significant bit and propagating 
through to the most significant bit. A full adder is a device that 
accepts two individual operand bits at the same binary weight, 
and also accepts a carry input bit from the next lesser significant 
weight full adder. The full adder then produces the sum bit for this 
bit position and also produces a carry bit to be used in the next 
more significant weight full adder carry input. The truth table for a 
full adder is shown in Figure 1. From this truth table, the equations 
for the full adder: 


S*=A@BOC 
Cy = AB + BC + AC, 


where A and B are the input operands to the full adder and C 
is the carry input into the adder. 
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Figure 1. Full Adder Truth Tabie. 


The sum output, S, represents the sum of the A and B operand 
inputs and the carry input. The carry output, Co, represents the 
carry out of this cell and can be used in the next more significant 
cell of the adder. Full adder celis can be cascaded as depicted in 
Figure 2 to form a four-bit ripple cany parallel adder. 


Note that once we have cascaded devices as shown in Figure 2, 
we may wish to discuss the equations for the i-th bit of the adder. 
in so doing, we might describe the equations of the full adder as 
follows: 


S; =A, ® B® C, 

Ci4, = AiBj + BIC) + AC; 

where the A; and B, are the input operands at the I-th bit, and 

the C, is the carry input to the /-th bit. (Note that the equa- ~ 

tions for this adder are iterative in nature and each depends 

on the result of the previous lesser significant bits of the 

adder array.) : ‘ 
The connection scheme shown in Figure 2 requires a ripple 
Propagation time through each full adder cell. Ha 16-bit adder is to 
be assembied, the carry will have to propagate through ail 16 full 
adder cells. Whatis desired is some technique for anticipating the 
carry such that we will not have to weit for a ripple ipary to 
propagate through the entire network By using some additional 
logic, such an adder array can be constructed. This type of adder 
Is usually called a carry lookahead adder. \ 


Figure 2. Cascaded Full Adder Celis Connected as a Four-Bit Ripple-Carry Full Adder. 
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A FOUR-BIT CARRY LOOKAHEAD ADDER 


Looking back to the equations developed for i-th bit of an adder, 
tet us now rewrite the carry equation in a shghtly different form. 
When we factor the C; in this equabon, the new equaton be- 
comes: 


Cia = AB, + C,(A, + Bi)- 


From the above equation, let us now define two additional aqua- 
tons. These are: 


G, = AB; 

Pie A +B; % 

” With these two new auniliary equations, we can now rewrite the 
Carry equahon for the Hh bit as follows: 


Chay = G, + PIC, 


Note that we have now developed two terms: the P, term is 
known as cary propagste and the G, term is known as carry 
generates. An anticipated carry can be generated at any stage of 
the adder by imp'ementing the above.equa!ions and using the 
@unubary functions P; and G, as required. 

Rt Is interesting to note that the sum equation can also be 
written in terms of these two auxiliary equations, P; and G;. For 
this case, the equabon is: 


Sj = (A + BAB) © C; 


The auxitary function G, is called carry generate, because if itis 
true, then a carry is immediately produced for the next accsr 
Stage. The tunction P, is called carry propagate because it implies 
there wili be a carry into the next stage of the eddar if there is a 
Carry inte this stage of the adder. Thatis, G;, causes a carry stanal 
at the rth stage of the adder to be generated and presented to the 
next stage of the adder whie P, causes an existing carry at the 


input lo the -th stage of the adder to propagate to the next staguof 
the acder. 


Let us now write ail of the sum and carry equations required fora 
full four-bit lookahead carry adder. 


So = A@ By® Cy 

S; = Ar@® B, @® (Sp + PoCo} 

S2 tA A2@ B8 (G, aa P,Gg + P PoC) 

S3 = A3® By @ (G, + PyG, + P2P,Gp + P2P;PoCq) 
Ci4g = G; + P3G2 + P3P2G, v P5P2P,Go +r P5P2P.P Cg 


An impoctant point to note is that ALL of the sum equations and 
the final carry output equation, C;, 4, can be written in terms of:he 
A;, B), and Co inputs to the four-bit adder. The configuration as 
described above is shown in Figure 3. This figure is divided intc 
two parts — the upper blocks show the auxiliary function 
generator circuitry required to impsement the P, and G; equations 
while the lower block impiements the log:c required to generale 
the sum output at each bit position. : 


A serious drawback to the lookahead cary adder is that ae the 
word length is increased, the carry funcrons ce ore 9:3 « and 
more complex, eventually becoin'ng imp-actical uu? to tts lerce 
number of interconnections and heavy oading of the G; ano 7, 
functions. The auxiliary function concept can be extended, how- 
ever, by dividing the word length into fairy sma incramaris arc 
Gefining blocks of auxiliary functions G and PF. 


Ris possible for a given block to dufine a funct-on G as the carry 
out generated with the block; and P can be dafinec as tre ca ty 
propagate over the biock. If the block size is set at four bits, then 
the functions for G and P for this block can be aei.ned ac fc.lows: 


G = Gy + P,G2 + PsP2G, + P3P2P;Gp 
Pp = P5P2P Po 


Figure 3. Full Four-Bit Carry-Lookahead Adder. 


It is important to necro that neither of these terms involves a 
carry-in (Co) to the b‘ock, so no matter how meny biocks are tied 
in an adder, all the p'ocxs have stable G and P functions available 
in a minimuna number of gate delays. 


The G and P functicns can be gated to produce a carry-in to each 
four-bit biock, 6s a tuncuco of the lesser significant blocks. The 
cerry-in to 2 block ts there‘ore: | 


Ca 3 Gye. = P,-,G.-2 + Pp-1Pp-2Gn-3 + nee 
a Pyaar 2Pr = oa P2PyP Co . 
Finally, the carry-in to each of the bits in a four-bit block must 
include-a te*m for the actual least significant carry-in; note, 
therefo:e, thai the equations for the four-bit full adder presented 
above inciude a term for carry-in at edch bit position. ~ 


Figure 4 shows the technique for cascading typical bit slice ALUs 
such as 'n2 Am29014 or Am2903 and one Am2902A in a full 
16-bit high-speed carry ‘vokahezd connection. Figure 5 shows 8 
connection sceme usin2 on. four bit slices In a 16-bit arithmetc 
gic urit conacction whee the camies are rippled between the 
devices. Each ot sce dues Use intemal carry lookahead over the 
our Block. 
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- Figure 4. Full Lookahead Carry 16-Bit Adder. 


or Az Az. By 01 87 83 AaAsAgAy 048s 0g 8; 


Oo 81 02 By 


AoA Az Ay 8 8, 87 By ex Panne Ao 
Case mara Coma 
01 


ann 


ce 


AaAsAroAn Bs S808 AraArzAraArg 812813810815 


Ao AVAD AS Oo 8; 83 03 mo At Az #3808107 03 | Cour 
Se Cues Cone 


a 


Figure 5S. Connection of 16-Bit ALU Using Ripple Carry. 


in summary, the ripple carry method can be used In conjunction 
with the lookahead technique in several ways. ‘ 


1. Lookahead carry over sections of the adder and ripple carry 
between these sections of the adder can be used. This 
method is often the most efficient in terms of hardware for a 
given speed requirement. It does not require the use of a 
lookahead carry generator such as the Am2902A. 

2. Lookahead carry across 16-bit blocks with a ripple carry be- 
tween 16-bit blocks can be used. This technique ts usually 
called two-level carry lookahead addition. This technique re- 
sults in very high-speed arithmetic tunction generation and 
makes a reasonable tradeoff between the speed and 
hardware for word lengths greater than 16 bits. 

Full lookahead carry across all levels and all block sizes can 
be used. This Is the highest speed arithmetic togic unit con- 
nection scheme. For word sizes up to 64 bits, itis referred to as 
three-level lookahead carry addition: Such a 64-bit ALU re- 
quires the use of five Am2902A carry lookahead generator 
units in addition to the 16 bit slice ALU devices as shown in 
Figure 6. 


OVERFLOW 


When two's complement numbers are added or subtracted, the 
result must lie within the range of the numbers that can be 
handled by the operand word length. Numbers are normally 
represented either as fractions with a binary point between the 
sign bit and the rest of the word, or as integers where the binary 
point is after the least significant bit. The actual choice for the 
location of the binary point is really up to the design engineer, as 


the hardware configuration required for either technique is identi- 
cal. itis also possible to use number notations that include both 
Integer and fractional representations in the same numbering 
scheme. Overflow is defined as the situation in which the result of 
an arithmetic operation lies outside of the number range that can 
be represented by the number of bits in the word. For example, if 
two eight-bit numbers are added and the result does not lie within 
the number range that can be represented by an eight-bit word, 


‘we say that an overflow has occurred. This can happen at either 


the positive end of the number range or‘at the negative end of the 

number range. The logic function that indicates that the result of 

an operation is outside of the representable number range is: 
OVR = Co, @ Cgi, 


~ where CG, is the cary-in to the sign bit and C,,, Is the | 
carry-out of the sign bit 


Thus, for a four-bit ALU with the sign bit in the most significant bit 


position, the two's complement overtlow can be defined as the 
Cn+4 tarm-exciusive OR’ed with the C,45 term. 


Putting the ALU In the Data Path of a Simple Computer 


Once the Design Engineer understands the basic configuration 
and operation of a simple high speed carry lookahead adder, he 
can begin to understand the configuration required to Implement 
the data handling section of a typical computing machine. The 
simplest architecture for the data handling path of a minicomputer 
is shown in Figure 7. Here, an accumulator is used in conjunction 
with an ALU to perform a basic arithmetic/storage capability tor 
data handling. The computer contro! unit of Figure 7 can be a 
simple or sophisticated state machine as described in Chapter 2. 
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Figure 7. Basic Computer Data Path. 


While the introductory material of this chapter concentrated on full 
adders, it should be understood that more ALU functions than 
addition are required if we are in to implement the data path of a 
typical minicomputer. Typically, some or all of the functions 
shown in Figure & are needed if we are to implement a powerful 
data handing capability. 


The operabon of the ALU/accumuiator configuration shown in 
Figure 7 can be described as follows. The accumulator can be 
so2zded by bringing dzla in from the data-in port through the A 
input o! the ALU, passed through the ALU and loaded into the 
accumulator. A second word of duta can be presented at the 
data-in port to the A input of the ALU and tite ALU can be used to 
pe torm an operation such as A + B,AORB, AANDB,A - Band 
se forth. The rasults of this ALU operation can then be placed into 
the accu ulalor, The accumulator output is available at the 
Oata-out >t !0" use eisewhere. Additional ALU functions such as 
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Figure 8. Basic ALU Instructions. 


those shown In Figure 8 are easily implemented by adding some 
additional! circuitry to the four-bit carry look ahead adder shown in 
Figure 3. If this circuitry is added, we will arrive at a logic diagram 
as shown in Figure 9. This diagram certainly is familiar to most 
CPU designers and is the well known Am74S181 four-bit arith- 
melic logic unit/function generator. 


Once the operation of the simple computer data path as shown in 
Figure 7 is understood, the Design Engineer will soon recognize 


.the need for additiona! registers if our machine is to be general 


purpose and execute instructions. Very rapidly the need arises for 
a register to hold a program counter (PC) and a memory address 
register (MAR). The purpose of the program counter Is to pointto 
the address of the next instruction in main memory. Typically itis 
loaded into the memory address register which actually provides 
the address on to the address bus of the machine. Then, the 
program counter is incremented through the ALU and stored untl 
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Figure 10.Three Register Computer Data Path. 


itis needed again. The block diagram of Figure 10 shows these 
additional registers connected in parallel at the output of the ALU. 
This ALU output is called the F bus. Each of these registers (the 
accumulator, the PC, and the MAR) has an enable Input from the 
CCU.so that they can selectively be loaded with data from the 
ALU. In addition, each of these registers has an.output enable 
such that they can be selectively enabled onto the D bus. The D 
bus represents the data output path from the basic computer data 


path and also is used as one of the inputs to the actual ALU/func- 
ton generator. The other input in this example is called the R bus 
and comes directly from the main memory data output as well as 
from the VO data Input. As shown in Figure 10, the memory 
address register (MAR) has a second ouiput that is used to drive 
the address bus. In this example, this register always contains the 
address to be applied to the external memory whether it be the 
address of data or the address of an instruction. 
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The best way to understand the operation of this single AL U/three 
Tegister machine is to take an example. Let us assume we have 
just completed the execution of ona machine instruction and are 
teary to fetch the next instruction. The first operation would be to 
transfer the current value of the program counter onto the D bus 
through the ALU onto the F bus and into the memory address 
register. This right be accomplished during one microcycle. The 
Second operation might be to.again put the PC on the D bus, pass 
it through the ALU 8 port and increment the value at the B port and 
reload it into the PC register. Thus, the PC has again been 
updated to point to the address of the next intruction. During this 
time, the address from the MAR ts on the address bus and we are 
fetching data from the extemal memory and placing if on the R 
Sus. The third microcycle would be to bring the data out of the 
extemal memory and pass itto the Instruction register in the CCU. 
The next microcycle might be to decode this instruction and 
determine that the next word after the current instruction in mem- 
Ory (an immediate operation) Is to be added to the value currently 
in the accumulator. Thus, we would again need to place the PC 
into the MAR on one cycie and then incremant the PC on the next 
cycle. Following this, te data from the extemal memory could be 


brought to the R bus through the A port of the ALU and added to ~ 


the accumulator value which is placed on the D bus and brought 
through the B port of the ALU. The resutt would be placed in the 


accumulator. This operation would complete the example and we _ 


would be ready to fetch the next instruction. As can be seen, a 


number of microcycles are required to fetch the instruction, de- | ~ 


code it, fetch the data and execute the Instruction. One of the best 
ways to understand the flow needed to implement a typical in- 
struction set is shown in Figure 11. Here, we see the basic 
instruction fetch and decode operation followed by the path used 
{0 execute each of the various instructions. Then, we see a retum 
to the fetch operation to fetch the next instruction. 


Certainly from this discussion we can see how three registers 
have enhanced the performance ol the simple ALU/accumulator 
Gata path shown in Figure 7. Typically, even more registers than 
shown in Figure 10 are needed if we are to increase the power of 


Figure 11. Steps for ADD Instruction. 


our machine. If we examine the block diagram of Figure 12, we 
se6 a similar architecture to that as shown in Figure 10. Here, the 
number of working registers has been expanded to sixteen at the 
output of the ALU. These can be used to provide a program 
counter function and a number of accumulator functions simul- 
taneously. In addition, note that the registers have two output 
ports such that the simultaneous selection of any two of the 
sixteen registers is possible. Both of these registers can be pre- 
sented to the ALU so that operations on ¢wo registers simullane- 
ously can be executed. In addition, a data input multiplexer is 
€vailable at the A port of the ALU such that extemal data can be 
brought in to the configuration. Likewise, there is an output mutt- 
plexer such that either the A output of the registers or the ALU 
output can be selected. This output multiplexer is used to provide 
@ data out port and the output can also be loaded into memory 
address register to provide an address as required. Thus, the 
architecture of Figure 12 is quite similar to that of Figure 10 except 
that the number of registers has been increased to provide add+ 
tional flexibility. : — 

If we assume that one of the sixteen registers inside of this 
register fila is to be used as the program counter, we see that the 
Program counter can ba brought out of the A output port and 
loaded into the memory address register and at the same time # 
can also be brought out the B output port and incremented in ALU 
and raloadad into the register file. In this architecture it appears 
the A output of the register stack can also be brought to the input 
multiplexer and the A port of the ALU and incremented via that 
path and reloaded into the registers. While this is possible in the 
architecture of Figure 12, we are leading up to the implementation 
of an Am2901A and this path is not needed in the Am2901A. 
Thus, we can implement functions and operations in the diagram 
of Figure 12 just as we could in the diagram of Figure 10. How- 
ever, what was previously performed in two microcycles can now 
be performed in one microcycie. That is, tha MAR can be loaded 
with the current value of the PC and at the same time the PC can 
be incremented and the new value restored in the PC register. 


Figure 12. Multl-Reglster ALU. 


Another foature of the block diagram of Figure 12 is the depiction 
of the carry in bit to the ALU and the four output flags associated 
with the ALU. Here, Carry in is the normal carry in as needed in 

, any adder such that the device is cascadable. In additicn, certain 
kinds of anthmetic functions such two's complement arithmetic 
also need the ability to provide a carry in for certain operations. 
The most common is two's complement subtract which is usually 
performed by complementing the operand to be subtracted, ad- 
ding and adding one at the carry In. Also, the ALU shows the four 
Output fiags usually associated with a typical minicomputer. 
These are the carry output, the sign bit, the overflow detect, and 
the zero detect. These four status fiags are used to determine 
various things about the operation being performed. The carry out 
flag and overflow flag are as descnbed in the previous sections of 
this chapter. They provide the carry and overflow information 
about the addition. 


The sign bit is simply the most significant bit of the ALU and 
represents the sign of a two's complement number. That is, when 
the sign bit is LOW, we assume the two's complement number is 
positive and when the sign bit is HiGH, we assume the two's 
complement number is negative. Thus, the sign bitis active HIGH 
and caries negative weight as we assume in any standard two's 
complement number representation. if the reader is unfamiliar 
with two's complement number notations, a discussion of this 
| topic can be found in an application note entitled “The Am25S05, 
Am2505 and Am25L05 Schottky, Standard and Low Power TTL 
Two's Comptement Digital Multipliers” as found in Advanced 
Micro Devices’ Schottky and Low Power Schottky Data Book 
dated 10/77. This application note begins on page 5-49 and fully 
details two's comp!ement numbar notation and gives examples. 


The fourth status flag is ca'led the zero flag and again is just what 
the name implies. This flag represents the fact that ail of the ALU 
outputs are at logic zero. in this design, a logic zero means that ail 
of the ALU output bits are LOW. 


If the architecture of Figure 12 Is extended a little more, we will 
arrive at the Am2901A as depicted in Figure 13. Here, we have 
redrawn the structure so that the registers are placed above the 
ALU; however, the function is identical. Two new functions have 
been added to this block diagram that have not previously been 
discussed. These are the RAM shift matrix located directly above 
the sixteen registers now described as a 16 x 4 dual port RAM. 
The purpose of the RAM shift network is to allow the ability of 
shifting the data word to be written into the register either up one 
bit position or down one bit position. The second function added 
to the block diagram is that of the Q register and shift network. 
Here, the Q register Is used as an auxiliary register such that 


double tength operations can be performed and itis alsousedin | 


the multiply and divide algorithms. In addition, the shift network 
allows the Q register contents to be shifted up one bit position or 
shifted down one bit position. In addition, it should be pointed out 
that the memory address registeris not part ofthe Am2901A. This 
is because there were not enough pins on the package to imple- 
ment tho function and the additional power required by the output 
butfers would have reduced the performance of the ALU and 
register stack. Instead, this function is baing designed into other 
2900 family products. 


Am29801A ARCHITECTURE 


A detailed block diagram of the Am2901A bipolar micropro- 
grammable microprocessor structure is shown in Figure 14. The 
Circuit Is a four-bit slice cascadable to any number of bits. There- 
fore, alt data paths within the circuit are four bits wide. The two key 
elements in the Figure 14 block diagram ara the 16-word by 4-bit 
2-port RAM and the high-speed ALU. 
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Figure 13. Am2901A Biock Diagram. 


Data in any of the 16 words of the Random Access Memory 
(RAM) can be read from the A-port of the RAM as controlled by 
the 4-bit A address fied input. Likewise, data in any of the 16 
words of the RAM as defined by the 8 address field input can be 
simultaneously read from the B-port of the RAM. The same code 
can be applied to the A select field and B select field in which case 
the identical file data will appear at both the RAM A-port 
B-port outputs simultaneously. | 


When enabled by the RAM write enable (RAM EN), new data is 
always written into the filo (word) defined by the B address field of 


_ the RAM. The RAM data input field is driven by a 3-input multi- 


plexer. This configuration is used to shift the ALU output data (F) If 
desired. This three-input muitiplexer scheme allows the data to 
be shifted up one bit position, shifted down one bit position, or not 
shifted in either direction. 

The RAM A-port data outputs and RAM B-port data outputs drive 
separate 4-bit latches. These latches hold the RAM data while the 
clock input is LOW. This eliminates any possible race conditions 
that could occur while new data fs being written into the RAM. 


The high-speed Arithmatic Logic Unit (ALU) can perform three 
binary arithmetic and five logic operations on the two 4-bit Input 
words R and S. The R input field is driven from a 2-input multi- 
plexer, while the S input field is driven from a 3-input multiplexer. 
Both multiplexers also have an inhibit capability; thatis, no data is 
passed. This is equivalent to a “zero” source operand. 

Referring to Figure 14, the ALU R-input multiplexer has the RAM 
A-port and the direct data inputs (D) connected as inputs. 
Likewise, the ALU S-injut multiptexer has the RAM A-port, the 


_ RAM B-port and the Q register connected as inputs. 
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This multiplexer scheme gives the capability of selecting various 
pairs of the A, B, D, OQ and “0” inputs as source operands to the 
ALU. These five inputs, when taken two at a time, result in ten 
possible combinations of source operand pairs. These combina- 
tons include AB, AD, AQ, AO, BD, BQ, 80, DQ, DO and QO. Itis 
apparent that AD, AQ and AO are somewhat redundant with BD, 
BQ and BQ in that if the-A address and B address are the same, 
the identical function results. Thus, there are only seven com- 
pletety non-¢edundant source operand pairs for the ALU. The 
Am2901A microprocessor-implements eight of these pairs. The 
microinstruation, inputs used to select the ALU source operands 
are the Ig. fj and lz inputs. 


The two goutce operands not fully described as yet are the D input 
and Qinput. The D input is the four-bit wide direct data field input. 
This port is used to insert all data into the working registers inside 
the device. Likewise, this input can be used in the ALU to modify 
any of the Mhternal data files. The Q register is a separate 4-bit file 
‘intended primarily for multiplication and division routines but it 
can also be used as an accumulator or holding register for some 
applications. 

The ALU itself is a high-speed arithmetic/ogic operator capable 
of performing three binary arithmetic and five logic functions. The 
ls, I, and [5 microinstruction inputs are used to select the ALU 
function. The definition of these tunctions is shown in Figure 15. 
The norma! technique for cascading the ALU of severa! devices is 
in a look-ahead carry mode. Carry generate, G, and carry propa- 
gate, P, are outputs of the device for use with a cary-look- 
ahead-generator such as the Am2802A ('182). Acarry-out, Cr+¢, 
ts also generated and is available as an output for use as the carry 
flag in a status register. Both carry-in (C,) and carry-out (Cp+4) 
@re acive HIGH. 
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Figure 15. Am2901A Microinstruction Control. 


Tne ALU has three other status-oriented outputs. These are F3, F 
= 0, and overflow (OVR). The F3 output Is the most significant 
(sign) bit of the ALU and can be used to determine positive or 
negative results without enabling the three-state data outputs. Fy 
Is non-inverted with respect to the sign bit output Y3. The F = 0 
output is used for zero detect. It is an open-collector output and 
can be wire OR'ed between microprocessor slices. F = Ols HIGH 
when all F outputs are LOW. The overflow output (OVA) is used to 
flag arithmetic operations that exceed the available two's com- 
plement number range. The overflow output (OVA) is HIGH when 
overflow exists; that Is, when C,43 and Cy,4 are not the same 


polarity. 


The ALU data output is routed to several destinations. Itcan be a 
data output of the device and it can also be stored in the RAM or 
the Q register. Eight possible combinations of ALU destination 
functions are available as defined by the ',, ly and I, micro- 
instruction inputs. These combinations are shown in Figure 15. 


The four-bit data output field (Y) features three-state outputs and 
can be directly bus organized. An output control (OE) is used to 


. enable the three-state outputs. When OE Is HIGH, the Y outputs 


are in the high-impedance state. 


A two-input multiplexer is also used at the data output such that 
either the A-port of the RAM or the ALU outputs (F) are selected at 
the device Y outputs. This selection is controlled by the Ig, ly and 
Ig microinstruction inputs. 

As was discussed previously, the RAM inputs are driven from a 
three-input multiplexer. This allows the ALU outputs to be entered 
non-shifted, shifted up one position (X2) or shifted down one 
position (+2). The shifter has two ports; one is labeled RAM, and 
the other is labeled RAM3. Both of these ports consist of a 
butfer-driver with a three-state output and an input to the multi- 
plexer. Thus, in the shift up mode, the RAM; bufferis enabled and 
the RAM, multiplexer input is enabled. Likewise, in the shift down 
mode, the RAM, buffer and RAM input are enabled. In the 
fo-shift mode, both buffers are in the high-impedance state and 
the multiplexer inputs are not selected. This shifter is controlied 
from the Ig, ly and tg microinstruction inputs. 


Similarly, the Q register ts driven from a 3-input multiplexer. In the 
no-shift mode, the multiplexer enters the ALU data into the Q 
register. in either the shift-up or shift-down mode, the multiplexer 
selects the Q register data appropriately shifted up or down. The 
Q shitter alsd has two ports; one is labeled Qy and the other is Q). 
The operation of these two ports is similar to the RAM shifter and 
is also controlled from ig, ly and ig. 


The clock input to the Am2901A controls the RAM, the Q register, 
and the A and B data latches. When enabled, data is clocked into 
the Q register on the LOW-to-HIGH transition of the clock. When 
the clock input is HIGH, the A and B latches are open and will pass 
whatever data is present at the RAM outputs. When the clock 
input is LOW, the latches are closed and will retain the last data 
entered. If the RAM-EN is enabled, new data will be written into 
the RAM file (word) defined by the B address field when the clock 
input is LOW. 


Am2903 GENERAL DESCRIPTION 


The Am2903 is a four-bit expandable bipolar microprocessor 
slice that performs all functions performed by the industry stan- 
dard Am2901A. In addition, it provides a number of significant 
enhancements that are especially useful in arithmetic oriented 
Processors. The Am2903 contains sixteen intemal working re- 
gisters arrangedin atwo address architecture and it also provides 
all of the necessary signals to expand the register file externally 
using the Am29705 register stack. Any number of ragisters can 
be cascaded to the Am2903 using this technique. In addition to its 
complete arithmetic and logic instruction set, the Am2903 pro- 
vides a special set of instructions which facilitate the implementa- 
tion of multiplication, division, normalization and other previously 
time consuming operations such as parity generaton and sign 
extension. A block diagram of the Am2903 is shown in Figure 16. 


ARCHITECTURE OF THE Am2903 


The Am2903 is a high-performance, cascadable, tour-bit b polar 
microprocessor slice designed for use in CF -’s, peripheral con- 
trollers, microprogrammable machines, and nu..-erous othu: ap 
plications. The micro:nstructon flexibility of the Am2903 alicws 
the efficient emulatica of almost any digital computing moch:n=: 
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Figure 16. Basic Am2903 Block Diagram. 


The nine-bit microinstruction selects the ALU sources, function, 
and destination. The Am2903 Is cascadable with full lookahead or 
rippie carry, has three-state outputs, and provides various ALU 
Status flag outputs. Advanced Low-Power Schottky processing Is 
used to fabricate this 48-pin LSI circuit. 


All data paths within the device are four bits wide. As shown in the 
block diagram of Figure 16, the device consists of a 16-word by 
4-bil, two-port RAM with latches on both autput ports, a high-per- 
formance ALU and shifter, a multi-purpose Q Register with shifter 
input, and a nine-bit instruction decoder. 


Two-Port RAM . 


Any two RAM words addressed at the A and B address ports can 
be read simultaneously at the respective RAM A and B output 
ports. identical data appear at the two output ports when the 
sume address is applied to both address ports. The latches at the 
RAM output ports are transparent when the clock input, CP, is 
HIGH and they hold the RAM output data when CP is LOW. Under 
contol of the OEg three-state output enable, RAM data can be 
rezd directly at the Am2903 DB V/O port. 


External data at the Am2903 Y I/O port can be written directly into 
the RAM, or ALU shifter output data can be enabled onto the Y VO 
port and entered Into the RAM. Data is written into the RAM at the 
B address when the write enable input, WE, Is LOW and the clock 
input, CP, Is LOW. 


Arithmetic Logic Unit 


The Am2903 high-performance ALU can perform seven arithme- 
tic and nine logic operations on two 4-bit operands. Multiplexers 
at the ALU inputs provide the capability to select various pairs of 
ALU source operands. The E, input selects either the DA exter- 
nal data Input or RAM output port A for use as one ALU operand 
and the O€g and ly inputs select RAM output port B, DB extemal 
data input, or the Q Register content for use as the second ALU 
operand. Also, during some ALU operations, zeros are forced at 
the ALU operand inputs. Thus, the Am2903 ALU can operate on 
data from two extemal sources, from an intemal and extemal 
source, or from two intemal sources. 


When Instruction bits Ig, 'g, lo, ly and Ig are LOW, the Am2903 
executes special functions. Figure 17 defines these special func- 
tions and the operation which the ALU performs for each. When 
the Am2903 executes instructions other than the nine special 
functions, the ALU operation is determined by instruction bits I4, 
ly, lp and t;. Figure 18 defines the ALU operation as a function of 
these four instruction bits. 


Am2903s may be cascaded in either a ripple carry or lookahead 
carry Yashion. When a number of Am2903s are cascaded, each 
slice must be programmed to be a most significant slice (MSS), 
Intermediate slice (IS), or least significant slice (LSS) of the arvay. 
The carry generate, G, and carry propagate, P, signals required 
fora lookahead carry scheme are generated by the Am2903 and 
are available as ontputs of the least significant and intermediate 
slices. 

The Am2903 also generates a carry-out signal, C,,44, which is 
generally available as an output of each slice. Both the carry-in, 
Cy. and carry-out, Ca44, signals are active HIGH. The ALU 
generates two other status outputs. These are negative, N, and 
overtiow, OVR. The N output is generally the most significant 
(sign) bit of the ALU output and can be used to determine positive 
or negative results. The OVA output indicates that the arithmetic 
Operation being performed exceeds the available two's comple- 
ment number range. The N and OVA signals are available as 
outputs of the most significant slice. Thus, the multi-purpose GIN 
and P/OVR outputs Indicate G and P at the least significant and 
intermediate slices, and sign and overflow at the most significant 
slice. To some extent, the meaning of the C,,4.4, P/OVA, and G/N 
signals vary with the ALU function being performed. 


~ 


ALU Shitter 


Under instruction control, the ALU shifter passes the ALU output 
(F) non-shifted, shifts it up one bit position (2F), or shifts it down 
one bit position (F/2). Both arithmetic and logical shift operations 
are possible, An anthmetic shift operation shifts data around-the 
most significant (sign) bit position of the most significant slice, and 
a logical shift operation shifts data through this bit position (see 
Figure 19). SIOy and SIO; are bidirectional serial shift inputs/out- 
puts. During a shift-up operation, SIQg is generally a serial shift 
input and SIO; a serial shift output. During a shift-down operation, 
S10, is generally a serial shitt input and SiOg a serial shift output. 


The ALU shifter also provides the capability to sign extend at slice 
boundaries. Under instruction control, the SIO (sign) inpui can 
be extended through Yo, Yy, Y2, Y3 and propagated to the SIO, 
output. 


F-S+C, #ZeL 
F-5+Ca nh Z=H 
F-SeC,#zl 
FeS-A-1+C,¢Z=H 


FeS+AeC, Zab 
FaS-A-1¢+C,dZ=H 


FuS*AeC, fZ=L 
FeS-A-19C,dZeH 


NOTES: 1. Al the most significant pica only, tha Cy, 4 sgnal ta neemally galed to the Y; output 
2 At the moat seonficant shce only. F; V OVA @ mternally gated to the Vy output 
3. Al the most significant slice only, S) VF 3 is generated at the Ys output. 


4. Op coces 1,3. 7, 9. B, 0. and F are reserved for future use. 
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Figure 18. ALU Functions. 


A cascadable, five-bit parity generator/checker Is designed Into 
the Am2903 ALU shifter and provides ALU error detection capa- 
bility, Parity for the Fo, Fy, Fz, Fy ALU outputs and SIO; input is 
generated and, under instruction control, is made available at the 
S!Op output. 


l=Low 
H = HIGH 
X = Dont Care 


Hi-Z = High lnpedance 
VW» Exciut-ve OR , 
Panty = SIO3 ¥ Fy ¥ Fa VF, ¥ Fa 
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Figure 19. 


The instruction inputs determine the ALU shifter operation. Figure 
17 defines tha special functions and the operation the ALU shifter 
performs for each. When the Am29C3 executes instructions other 
than the nine special functions, the ALU shifter operation is de- 
termined by instruction bits Igl7igis. Figure 20 defines the ALU 
shifter operation as a function of these four bits. 


Q Register - 


The Q Register is an auxiliary four-bit register which is clocked on 
the LOW-to-HIGH transition of the CP input. Itis intended primar- 
ity for use in multiplication and division operations: however, itcan 
also be used as an accumulator or holding register for some 
applications. The ALU output, F, cum be loaded into the Q Regis- 
ter, and/or the Q Register can be selected as the source for the 
ALU S operand. The shifter at the input to the Q Register provides 
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Figure 20b. Am2903 ALU Destination Control Summary. 


the capabiltty to shift the Q Regisfer contents up one bit’ position 
(2Q) or down one bit position (Q/2). Only logical shifts are per- 
formed. QlOp and QI0; are bidirectional shift seria! Inputs/out- 
puts. During a Q Register shitt-up operation, Q!Og Is a serial shift 
Input and QIO; is a serial shift output. During a shift-down opera- 
tion, Q/O3 is a serial shift input and QiOg is a serial shift output. 
Doubdle-length arithmetic and logical shifting capability Is pro- 
vided by the Am2903. The double-length shift Is performed by 
Connecting QIO; of the most significant slice to 610g of the jeast 
significant slice, and executing an inslruction which shifts both the 
ALU output and the Q Register. 


The Q Register and shifter operation is controlled by instruction 
bits Iglzlglg. Figures 17 and 20 define the Q Register and shifter 
operation as a function of these four bits. 


Output Butfers 


Tha DB and Y ports are bidirectional /O ports driven by three- 
state output buffers with extemal output enable controls. The ¥ 
output buffers are enabled when the OFy inputis LOW and arein 
the high-impedance state when OEy is HIGH. Likewise, the DB 
output butfers are enabled when the OE g input is LOW andin the 
high-impedance state when OE, is HIGH. 


The zero, Z, pin is an open collector input/output that can be 
wire-OR’ed between slices. As an output it can be used as a zero 
detect status flag and generally indicates that the Yo.3 pins are all 
LOW, whether they are driven from the Y output buffers or from an 
extemal source connected to the Yp.3 pins. To some extent the 
meaning of this signal varies with the instruction being performed. 


Instruction Decoder 


The instruction Decoder generates required intemal control sig- 
nals as a function of the nine Instruction inputs, Io.9; the Instruc- 
tion Enable input, IEN; the LSS input; and the WRITE/MSS in- 
puvoutput. The WRITE output is LOW when an instruction which 
writes data into the RAM is being exeeuted. 


When TEN Is LOW, the WRITE output is enabled and the Q 
Register and Sign Compare Flip-Flop can be written according to 
the Am2903 instruction. The Sign Compare Flip-Flop is an on- 
chip flip-flop which is used during an Am2903 divide operation. 


Prograraming the Am2903 Slice Positlon 


Tying the LSS input LOW programs the slice to operate asa least 
significant slice (LSS) and enagjes the WRITE ot E output,signat onto 
the WRITE/MSS bidirectional /O pin. When LSS Is tied HIGH, the 
WRITE/MSS pin becomes an input pin; tying the WRITE/MSs5 pin 
HIGH programs the slice to operate as an intermediate slice (IS) 
and tying it LOW programs the slice to operate as a most siynili- 
cant slice (MSS). This is shown in Figure 21. 
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EXPANDING THE NUMBER OF Am2003 REGISTERS 


The Am2903 contains 16 intemal working registers configured in 
a standard two port architecture. The number of working registers 
in the ALU configuration can be increased by utilizing the 
Am29705 16-word by 4-bit two-port RAM. Any number of 
Am29705's can be connected to the Am2903 to increase the 
number of working registers. Figure 22 shows a block diagram of 
the basic Am29705. As is seen, the device consists of a 16 word 
by 4 bit two port RAM with latches at the A and B outputs similar to 
the RAM contained within the Am2903. Each of the latch outputs 
has three state drivers capable of driving the DA and DB inputs of 
the Am2903. The Am29705 Is a non-inverting device. That is, 
data presented at the inputs is stored in the RAM and when 
brought to the RAM outputs, it is non-Inverted from when it was 
Orginalty brought into the device. 


The technique for using the Am29705 to expand the number of 
registers in the Am2903 can best be visualized by referring to 
Figures 23 and 24 simultaneously. In Figure 23, the data bus 
connections are shown such that the Am2903 Y output is used to 
drive the Am29705 inputs. Here, we also assume this bus may be 
tied to a data bus through a bi-directional buffer. in Figure 23, the 
A outputs of the Am29705 are connected together and also 
connected to the DA input of the Am2903. Likewise, the B outputs 
from the Am29705 are also shown connected to the DB inputs of 
the Am2903. In all cases, we are assuming 16-bit data busses. 
Thus, four Am2903's are assumed and eight Am29705's are 
assumed. As shown In Figure 23, one of the write enable inputs to 
the Am29705 is tied to the latch enable input of the Am29705 and 
these pins are also tied to the clock input of the Am2903. This 
allows the latches in the Am29705 to perform identically to those 
in the Am2903. 
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Figure 22. Am29705 Biock Diagram. 
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Nwe rete: to Figure 24, we see tne connec!-ons required to set up 
the aderessing for addittonal registers associated with the 
Am2303. Here, three two-line to four-line decoders are used to 
feoperly contro! the A address, B address and write enable sig- 
nals to the devices. As shown in Figure 24, the four A address 
lines are all tied in paratiel betwean the Am2903 and the 
Amz29705's. The two-line to four-iine decoder is used to enable 
f\9 appropnate output enzb'!e from the Am29705's or switch the 
EA MUX inside the Am2993 such that the proper register is 
Selected. The B address ope'ates in a similar fashion in that the 
four B acdress |.nes are also ali ted togother. Likewise, a two-line 
to four-line decoder is used lo property select the output enable of 
either ihe Am29705's a the Am2903 such that the correct source 


operand register is selected. In addition, a two-line to four-line 
decoder is used to control the write enable signal such that only 
one register Is written into as a destination. This is controlled by 
properly selecting the write enable of either the Am2903 or the 
Am29705 as determined by the two most significant bits of the B 
address. : — 


Wf this technique Is used properly, any number of Am29705's can 
be used in conjunction with the Am2903. It may be necessary to 
use either a three-line to eight-line decoder or perhaps even e@ 
larger circuit to decode the more significant bits of the A and B 
addresses. Likewise, the write enable signal must be controlled 
so that the correct destination register will be written. 


UNDERSTANDING BIT SLICE TIMING 


Perhaps one of the most important aspects of designing with 
either the Am2901A or the Am2903 is understanding the calcula- 
tions required to compute the worst case AC performance. In 
order to perform these calculations, we have selected a number 
of standard Schottky devices and assigned minimum, typical and 
maximum speeds at 25°C and 5V for use in these calculations as 
shown in Figure 25. Certainly the design engineer should use the 
exact specifications of the devices he has selected for his design 
in order to perform the worst case calculations. What is intended 
here is an understanding of the technique to perform these calcu- 
lations and some method to allow a comparison of the Am2901A 
and Am2903 in terms of their AC periormance. Since at the time 
of this writing the Am2903 is still being characterized. only the 
typical AC data is currently available. Thus, all calculations wil be 
made using the typical AC times such that we can compare the 
Am23901A with the Am2903. When final characterization data on 
the Am2903 's available, the designer can then compute hs 
performance by selecting the appropriate temperature range and 
power supply variations as required by his design. 


Figure 26 shows the typical AC calculations for the functions 
usually considered in an Am2901A design. These functions are 
usually the speed for a logic operation, arithmetic operation, logic 
Operation with shift and arithmetic operation with shift In each 
case, we are computing speeds tram the LOW-to-HIGH transition 
of a clock through an entire microcycie to the next LOW-to-HiGH 
transition of a clock. 


TWO ADDAEES OPERATION 


Figure 24. Am2903 — RAM Address Cascading. 
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Figure 25. Standard Device Schottky Specds. 


Similarly, Figure 27 shows.the same type of computations for an independent of the shift operation. This Is easily seen in Figure 
Am2903 system. Thera Is one very important distinction that 27. 

should be made in computing the timing of an AM2903 16-bit ALU . 
when compared with an Am2S01A ALU inthatin the Am2903, the By way of comparison, Figure 28 shows speeds for the ‘cu. typ.:s 
shifter is at the output of the ALU and is followed by the zero of operations for the Am2S01A 16-bit system as compared with 
detector. Thus, in an Am2903 design, the flags are no longer the Am2903 16-b’* system. 
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Figure 26. Typical AC Calculations for the Am2901A. 


Figure 26. (Cont.) 
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TWO'S COMPLEMENT ARITHMETIC OPERATION 
WITH SHIFT DOWN — 16 BIT 
SPEED COMPUTATIONS 
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Figure 27. (Cont.) 
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Figure 28. Summary of Am2901A and Am2903 AC 
Performance In a 16-Bit Configuration. 


USING THE Am2903 IN A 16-BIT DESIGN 


Perhaps the best technique for understanding the design of the 
16-bit ALU is to simply take an example. Figure 29 shows a block 
diagram overview of four Am2903's with the appropriate shift 
matrix control, status register, MAR and the usual interface to a 
CCU and main memory. This block diagram represents the nor- 
mal data handling path associated with a simple 16-bit minicom- 
puter. If we expand this block diagram to show what would nor- 
maily be considered to be the complete 16-bit central processing 
unit, the block diagram of Figure 30 results. Here, we see the 
Am2903's surrounded by a typical set of MSI support chips. In 
addition, the block diagram shows a typical computer control unit 
as described in Chapter 2 of this series. Thus, all of the blocks are 


PARALLEL 


vO PORT 


DATA IN 
MAIN 


now In place to show a simple 16-bit microcomputer built using 
the Am2900 family devices. The full design for such a machine is 
shown In Figure 31. 


Figures 31A, Figure 31B and Figure 31C detail the connection of 
each IC used in this design. Quite simply, the design can be 
described as follows. Figure 31A represents the microprogram 
sequencer portion of the design. U1, U2 and U3 are the instruc- 
tion register that receive a 16-bit instruction from main memory. 
U4, U5 and U6 are the mapping PROMs used to decode the OP 
code portion of the instruction to arrive at a starting address for 
the microprogram sequencer. The microprogram sequencer is 
the Am2910 and is shown as U7. The branch address pipeline 
register is U8, U9 and U10 and can be enabled to the D inputs of 
the Am2910 sequencer to provide the jump address from micro- 
code. The pipeline register for the instruction inputs to the 
Am2910 is U14. This machine also has the ability to select tha A 
and B addresses for the Am2903 devices from the microprogram 
as well as the instruction register and U11 and U12 provide this 
Capability as a part of the pipeline register. U13 is a two line to foar 
line decoder used as part of the control for the A and B address 
select for the Am2903's. U15 is part of the pipeline register and 
provides both true and complement outputs for bit 11. U16 and 
U17 represent a one of sixteen decoder whose output can be 
applied to the DA bus to allow the implementation of ail the bit 
operations. These include bit set, bit clear, bit toggle and bit test. 
U18 and U19 are PROM's that provide the ability to enter one of 
thirty-two preprogrammed constants onto the DA bus. . 


Figure 31B is predominately the data handling portion of the 
design. Here, U20 and U21 represent a data register that re- 
ceives data from the data bus. U26, U27, U28 and U29 are the 
four Am2903’'s that form a 16-bit register/ALU combination. U30 
is the carry look ahead generator for the ALU section. U22, U23 
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___ Figure 29. Am2903 with Shift Mux and Status Register. 
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and U24 represent the status register with the ability to save and 
festore the flags in main memory. U25 is the condition code 
multiplexer for the microprogram sequencer. U33, U34, U35 and 
U36 represent the shift linkage multiplexers that tie together the 
internal shifters within the Am2903's. U37 is part of the pipeline 
tegister and provides both true and complement outputs of a 
number of the microprogram bits. U38 is part of the carry in logic 
control such that double length arithmetic operations can be 
performed. U31 and U32 are the data out register that can be 
used to accept data from the Am2903s and enable this data onto 
the data bus. U39 and U40 represent the memory address regis- 
ter and are used to hold the address provided tom ine CPU to 
main memory. 


The microprogram store is shown in Figure 31C. Here, we have 
used both the 512 x 8 registered PROM’s and 512 x 4 non- 
registered PROM's in this design. A total of 68 microprogram bits 
have been depicted in this design. These are shown so that 
maximum flexibility is achieved. In most typical designs some 10 
to 20 of these bits would not be used. Figure 31C shows four 
512-word by 8-bit registered PROM's (U41, U42, U43 and U44). It 
+ also shows nine 512-word by 4-bit PROM’s represented as U45 
_ through US3. 

Perhaps the best way to review the design is to simply understand 
the function of each of the microprogram control bits. If the pur- 
pose of each of these bits is well understood, the design engineer 
will be well along in understanding the design of the simple 
min'computer CPU presented here. 


The ddictonreotarn Structure” 


The microprogram for the design shown in Figure 31 is 68 bits 
wide. The functions of the microprogram contro! bits are as fol- 
lows: 


Bits PLO The9d instruction bits of the Am2903 auper 
through PLS Ss slices. . 
Bits PLS, The TEN, EA, OEB control ‘pute of the 
PL10, PL11 Am2903 superslicas, respectively. PL11 Is also 
. connected to the data-in registers (U20 and 
U21) output-enable. This connection assures 
that there will be no conflict on the DB pins. -. 
Bits PL12 Select the source for SIO of the Am2903, both 


through PL14 for shitt-up and for shift-down operations. The 
(x12 through following table summarizes the functions of 
p14) these bits. 


Microprogram Bits . Slo, $10, 
14 13 12 (Shift-down) (Shift-up) 


0 0 
S109 SIO, 
Ql0, alo, 


cr 


Carry Carry 

Zero ‘Zero 

Sign : Sign 

Not allocated Not allocated 
1 1 


rrrire 


Bits PL15 Select the source for QIO of the Am2903, both 
through PL17 for shift-up and shift-down operations. The fol- 
(#15 through lowing table summarizes the functions of 
p17) . these bits. 


Microprogram Bits alo, aio, 
17 16 15 (Shift-down) (Shift-up) 


L L L (0) ie] 
L H SIO, S10, 
L QlO, alo, 
H Carry Carry 
L Zero Zero 
H Sign Sign 
L Not allocated Not allocated 
H 1 1 


Bit PL18 


When LOW, enables the MAR clock input, i.e. 
the data appearing on the Y output pins of 
the Am2903 Supersiices™ will be clocked into 
the MAR at the LOW-to-HIGH transition of 
the clock pulse. 


Bit PL19 When LOW, enables the MAR output onto the 


Memory Address Bus. 


When LOW, enables the data output register 
clock, i.e. the data appearing in the Y output 
pins of the Am2903 Superslices™ will be 
clocked into the data output registers (U31 
and U32) at the LOW-to-HIGH transition of 
the clock pulse. 


Bit PL21 .. When LOW, enables the data output registers 
onto the Data Bus. Pia 


When LOW, enables the data-in register clock, 
i.e. the data appearing in the Data-Bus will be 
clocked into the data-in registers at the 
LOW-to-HIGH transition of the clock pulse. 
This Is the Cl input of the Am2910 micropro- 
gram sequencer, 

Bits PL24 This is a 4-bit wide field which can be used 
through PL27 either for the A-address, for the B-address or 
for both A and B addresses of the Am2903 
superslices. 


Bit PL20 


Bit PL22 


Bit PL23 


Bits PL28 This is a 4-bit wide field, which can be 

through PL31_—_-used for either the A-address of the Ami2903 
superslice or to designate one of sixteen bits to 
the DA inputs of the Am2903 superslice via the 
Am2921's (416 and 417). 

Bits PL32 Select the source for the Am2903 A-address, 

and PL33 according to the table below: : 


rae A-Adcress Source 


Data Bus bits 0 through 3 
Microprogram bits 28 through 31 
Data Bus bits 4 through 7 
Microprogram bits 24 through 27 


Selects the source of the Am2903 8-address, 
according to the table below: 


i B-Addross Source 
Data Bus bits 4 through 7 
Microprogram bits 24 through 27 


Bit PL34 
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Bit PL3S Is the C,, input of the least significant Am2903 
via an Am74S157 mux (38). 

Bits PL36 Affect the status register input signals, ac- 

and PL37 cording to the table below: 


Part Halal re ce it aes 


Previous Carry Previous Zero, 
Sign, Overtlow 

Previous Zero, 
Sign, Overflow 

Am2903 superslices’ Output 

Data Bus bits 0 through 3 


Previous SlO15 


Bit PL38 Selects either the carry flip-flop or the PL35 bit 


for carry in. 


Bit PL39 When LOW, enables the status register output 


to the data bus bits 0 through 3. 


Controls the output polarity of the one-of-six- 
teen bit select logic. 


When LOW, enables the Instruction register 
(Ui, U2, U3) clock. The data present at bits 0 
through 15 of the Data-Bus will be latched into 
the Instruction register at the next LOW-to- 
HIGH transition of the clock pulse. 


Bit PL40 


Bit PL41 


Bit PL42 


the main memory that a memory read is re- 
quested. 


This is an output signal. When HIGH, it signals 
to the main memory that a memory write is re- 
quested. 


Selects the source of the one of sixteen bit de- 

coders (U16 and U17) . When LOW, the output 

of the Am2919 register (U12) containing the pre- 
viously latched microprogram bits 28 through 

31 will be applied to the decoders. When HIGH, 

the output of the Am2919 register (U3) con- 

taining the previously latched Data-Bus bils 0 

through 3 will be applied to the decoders. 


Selects the Am2903 Superslices™’ DA port 
source. When LOW, the output of the one of six- 
teen bit decoder (U16 and U17) will be applied 
to that port. When HIGH, the output of the 
Am29771 PROM's (U18 and U19) will be ap- 
plied to the Am2903 DA ports. 


These are the ALD and CCEN control inputs 
and PL47 of the Am2910 sequencer, respectively. 


Bits PL48 These select the condition code according to 
through PLSO the following table: 


Condition Code Selected 
48 


Carry 
Sign | 
Zero 

Overflow 


Bit PL43 


Bit PL44 


Bit PL45 


Bit PL46 


Bits 
50 49 


Not Allocated 


HBEIIecee 
DTrrerrertire 
mexexreier 


This is an output signal. When HIGH, it signals - 


. Outputs. Bit 9 (IEN) is LOW and theretore, W 


Bit PLS1 Is the condition code potarity control. When 
HIGH, the condition code selected will pass non- 
* inverted. When LOW, the selected condition 
code will be complemented. 
Bits PL52 Are the | inputs of the Am2910 sequencer. 
through PL55 
Bits PL56 This is a 12-bit wide field and it serves, usu- 


through PL67 ally as the next microprogram address.How- 
ever, the 5 least significant bits of this field (bits 
56-60) serve also as an address field of the 


Am29771 “constant” PROM’s (U18 and U19). 


Some Sample Microroutines 


Figure 32 shows the microprogram code for a few sample micro- 
routines. Different addressing schemes are demonstrated with 
the “ADD” operation. All the other arithmetic or logic operations 
can be easily programmed by substituting the |,-l4 field of the 
Am2903 with the appropriate function. Since the main memory 
address is generated by the Am2903 superstices, the intemal 
tegister No. 15 serves as the program counter. 


The following is a description of some sample microroutines. The 
reader should refer to the description of the microprogram bits_ 
given earlier in this chapter and to the data sheets of the Am2910 
sequencer and of the Am2903 superslice. 


Microword INIT. 


This microword should be at address 0 and when the machine is 
reset, the Am2910 will start executing from here. The purpose of 
this location is to reset the machine program counter (Register 
15) to zero. Ultimately more microinstructions can be added, 
should the necessity of other reset functions arise. 


Bits 1-4 (Am2903 I,-14) being 8, will cause the supersiices to 
generate all zeroes at the F-points (intemal). Bits 5-8 (Am2903 
Ig-1g) being F,; will cause this data (all zeroes) to appear on the Y 
will be LOW 
and this data will be written into the internal register selected by 
the B-address inputs. Bit 34 is HIGH; therefore, microprogram 
bits 24-27 will be selected as B address source. Since Fy is in 
these bits, all zeroes will be written into the program counter 
(Register 15). Bit 18 is LOW; therefore, the data at the Y outputs 
(all zeroes) wil be latched into the MAR at the next clock pulse. 
Bits 36 and 37 are set such that the flags will be updated, namely 
CY=N=OVF=0, Z=1. 


Bits 42, 43 are both LOWso nomemory reference signalis sentto 
the main memory (the MAR is still in an undetermined state). Bits 
52-55 (Am2910 I) are set to Ey which will force the sequencer to 
continue to the next sequential address (1) as the Cl (bit 23) is 
HIGH. 


Bits 21 and39 are both HIGH to ensure that there is no conflict on 
the data bus though in this case one of them could be a DONT- 
CARE. Bit 38 could also be a DON'T-CARE as the carry is zeroed 
by the ALU. Making a HIGH in bit 46 enables executing this 
microstep without disturbing the Am2910 sequencer’ intemal 
register which at power-up has no significance but may be impor- 
tant, should a software restart be issued. 


All the other bits are DON'T-CAREs. 
Microword FETCH’ - 


This Is the first step in the machine instruction fetch routine. In this 
step, the main memory is addressed by the MAR, a read signal is 
Issued (bit 42 = HIGH), and the machine instruction (mac- 


. “toinstruction) is“placed on the data bus by the memory. itis 


~~. 


i ec 


Number of Bits 


x 
AODIMM + 1 |FETCH +1 


ADD DIR 
ADO DIR +1 
ADD DIR + 2 |ADDIMM + 1 


g 
wo 
FETCH + 1 
af 
x 
x 


ooo 
—~aO]saow 


ADD RAI + 2 |f ETCH + 1 


S OE6 EA IEN Ic_g th-4 
.4 


o 


1 


Nonberofeis]t +24 «[4]a 3 a] 4 43 


3338 Slsila se 
$83 
[x 4X xX F| | xo] 


15-17 
12-14 
"1 


ADDIMM 
ADDIMM + 1 


ADD DIR 
ADD DIR +1 
ADD DIR + 2 


ADD RAI 
ADD RAI +1 
ADD RAI + 2 


KK | KKK | OK OK | KL OK XIX 
KK | KOK KE | OK OK | OK | Oe <I ><: 
~xXxX/xX=-O;/“~aO;lo0/lo o|x 


1. 4-bit fields in hex, others in octal. 
2. X = Don't Care. 


Figure 32. Example Microcode for Figure 31 Design. 
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latched into the instruction register (U1, U2, and U3) at the next 
clock LOW-to-HIGH transition (bit 41 = LOW). Itis assumed that 
if a relatively slow main memory is used, the clock Is halted until 
the data is stable on the data bus and the register set up times are 
met. We will see in a later chapter how easy itis to implement this 


* requirement using the Am2925 clock generator. The same as- 


sumption will also be made in a memory write cycle. 


Bit 9 (Am2903 TEN) is HIGH; thus, we don't care what the ALU 
does during this microstep. We prevent the flags from changing 
by setting bits 36-38 LOW. Also, the registers atthe Y output have 
the E input HIGH (bits 18, 20). Bits 21 and 39 are both HIGH; thus, 
the data bus is free to accept data from the main memory (bit 42 is 
HIGH, signaling memory read request). The MAR is enabled to 
the address bus (bit 19 = LOW) and at the next clock, the 
macroinstruction will be latched into the instruction registers (bit 
41 = LOW). The Am2910 sequencer will continue to the next 
instruction (bits 52-55 = Ey). 


Microword FETCH + 1 


This is the second step in the macroinstruction fetch routine. The 
instruction already resides in the instruction registers U1, U2 and 
U3). 

The Am2910 sequencer receives a JUMP MAP instruction (bits 
52 though 55 = 2). The next microinstruction will begin to execute 
the present macroinstruction — according to the mapping PROM. 


We use this microstep to update (increment) the program counter 
(Register 15). Bit 34 being HIGH, microprogram bits 24-27 (=F) 
will be the B address. The Am2903 OEB and Ip are LOW, there- 
fore, the contents of Register 15 will serve as the S operand for 
the ALU. C,, being HIGH, a 4 in the |,-1, field will increment this 
value. IEN = LOW with Is-l, = F will write this (incremented) value 
into the same register (R15). At the same time, the MAR Is also 
updated (bit 18 = LOW). 


We could update the program counter and the MAR in the previ- © 


ous microstep (location FETCH), but then we had to leave the 
ALU idle during this microcycle. By adopting the present scheme, 
we can overlap the first step of the macroinstruction fetch routine 
(the memory-read cycle) with the execution of the last step of the 
Previous macroinstruction ~ provided the memory and the data 


bus are free to perform it. The JUMP MAP cycle is always neces- 


Sary — and thal is why we prefer to update the PC at this step. 
Microword ADD , 


This is a sample register-to-register operation. The two operands 
reside in the internal registers pointed to by the two 4-bit fields of 
the macroinstruction: 


15 - 87 
1st Operand and 


Destination Register 
Number 


2nd Operand 


OPCODE Register Number 


Bits 32-33 are set LOW, instruction register bits 0-3 are selected 
as A addréss. Bit 34 = LOW selects instruction register bits 4-7 as 
B address (see Fig. above). Bit 1 (Ip), bit 10 (EA) and bit 11 (OEB) 
are also LOW; therefore, the contents of the selected registers 


. willbe presented to the ALU's R and S inputs. Bits 1-4 (I-14) = 3, 


the ALU will perform: 
F = R plus S plus C,. 


Note that bit 35 and 38 are LOW. With Is-Ig (bits 5-8) = F,, andlEN | 


(bit 0) = LOW, the result will be written into the internal register 
pointed at by the B address lines. 


43 0 


Bits 18 and 20 are HIGH and inhibit the MAR and the data out 
registers trom being affected, while bits 36, 37 (=2) allow the 
flags to assume values according to the result of the operation. 


During the execution of the function required (AOD in this exam 


-ple) we fetch the next OP CODE from the main memory. The 


MAR is onabied to the address bus (bit 19 = LOW) and amemory 
read is requested (bit 42 = HIGH). At the end of this microstep the 
next macroinstruction will be latched into the instruction registers 
(bit 41 = LOW). 


The Am2910 sequencer Is instructed to select the pipeline regis- 
ter bits 56-67 as the next microprogram address (bits 52-57 = 7, 
bit 47 = HIGH) where the location of FETCH + 1 (2 in this 
example) is written. The next step will be JUMP MAP and update 
PC. 


Microword ADD IMMEDIATE 


This 2 step microroutine adds the contents of an internal register, 
pointed at by bits 0-3 of the macroinstruction with its second word, 
placing the result into the Internal register pointed at by bits 4-7 of 
the OPCODE. 


15 87 43 0 


Result 


2nd Operand 


OFCORE Register Number 


Register Number 


First word of the macroinstruction 


15 ; 0 


DATA (1st Operand) 


Second (next consecutive) word of the macroinstruction 


The first step is to read the ficst operand from the memory (bit 19 
= LOW, bit 42 = HIGH) and to latch it into the data-in register 
(U20 ard U21) (bit 22 = LOW). At the same time the ALU updates 
(increments) the program counter (register 15) and the MAR (bit 
18 = LOW). (Compare the location FETCH + 1). The Am2910 
sequencer will continue to the next microprogram address (com- 
Pare to location FETCH). 

Location ADDIMM + 1 Is the second step of this macroinstruc- 
tion. It is very similar to location ADD, the only difference Is that bit 
11 (OEB) is HIGH, selecting the Data-in register as source for the 
ALU's S operand. The same macroinstruction fetch overiap 
technique is used again. 


Microword ADD DiRect . - 


This Is the starting location to execute a macroinstruction where 
the second word is the address of the operand: 


15 87 43 0 


2nd Operand 
Register Number 


Result 


Register Number 


First word of the macroinstruotion 


Address of the 1st operand 


Second (next consecutive) word of the macroinstruction 


The first step is to read the second word of the macroinstruction 
into the Data-in register. This microword is identical to the one 
wnitten at jocation ADDIMM. 


Microword ADD QIR + 1 


The Data-in register now contains the address of the operand. 
We have to transfer it into the MAR. 


With tp (bit 0) LOW and OEB (bit 11) HIGH, the ALU's operand will 
be the DB bus, i.e., the Data-in ragister. I-14 (bits 1-4) = 4 will 
Pass this input to its output, as C,, (bit 3) is LOW. With (bIt9) = 
HIGH, the WRITE line will be HIGH too, assuring that the internal 
registers maintain their contents. Since Is-lg (bits 5-8) = Fy, the 
ALU output will appear on the Am2903 Y pins. This data which is 
actually the operand address and will be transferred into the MAR 
at the next clock cycle. The Am2910 sequencer continues to the 
Next consecutive microstep. 


Microword ADD DIR + 2 


Now we read in the operand from the main memory. The MAR is 
enabled to address bus (bit 19 = LOW), a memory read signal is 
issued (bit 42 = HIGH) and the data-in registers clock is enabled 
(bit 22.= LOW). At the next LOW-to-HIGH transition of the clock, 
the operand wil! be placed in the data-in register. 


Meanwhile, we need to restore the address of the next mac- 
roinstruction in the MAR. Bits 32-33 = 3 select microprogram bits 
24-27 as the A address (an Fy Is written there): therefore, the 
internal program counter will be addressed, as EA (bit 10) = 
LOW. The ALU performs an F = A + C,, with C, (bit 35) LOW, 
thus passing the program counter contents to the output. JEN (bit 
9) = HIGH prevents disturbance of intemal Am2903 registers and 
bit 18 will enable the MAR to receive the next macroinstruction 
address. 


Note that the situation now Is exactly the same as after the first 
step of ADD iMMediate. The operand is In the data register and 
the MAR points to the next macroinstruction. Therefore, the 
Am2910 sequencer will address, as the next microstep, location 
ADODIMM +1. The step atter this will, of course, be FETCH + 1.A 
total of 5 microsteps were needed to execute this macroinstruc- 
tion but it occupies only 3 microprogram locations. 


kt is worthwhile to note here that by adding two more Am2920 
registers between the Data-bus and the Address-bus and a 
couple of control-bits in the microprogram, we could shorten the 
microprogram by one step. In this design we chose not to do so In 
order to demanstrate the Data-bus to Address-bus path through 
the ALU. ‘ 


Microword ADD RAI 


The macroinstruction to be excuted here points to the register in 
which the first operand is written, and also into which the result 
should be written. The second 4-bit field of the OP-CODE (bits 
0-3) points to the register in which the address of the second 
operand is stored. 


15 87 43 0 


1st Operand and 


2nd Operand’s 
Address Register 
Number 


OPCODE « | Result Register 


Number 


Bits 32 and 33 are LOW. Therefore, instruction register bits 0-3 
will form the A-address. Now we take the contents of this register 
and piace it in the MAR exactly the same way as we did in location 
AOD DIR + 2 with the program counter. The Am2910 continues. 


Microword AOD RR1 + 1 


Here we fetch the operand and place it in the Data-in register. At 
the same time, we restore the program counter into the MAR. 


Microword ADD RAI + 2 


Bits 32, 339 = 2 and instruction register bits 4-7 serve as the 
A-address. Bit34 = LOW; the same instruction register bits serve 
as B-address, too. Note, that OEB (bit 11) is HIGH: therefore, the 
ALU R-source will be the Data-in register and the S-source will be 
the register addressed by A-address. The result (sum), however, 
will.be written to the correct register, as TEN (bit 9) is LOW. 


At the same time, the next macroinstruction is fetched in the 
usuall oooverlapping way and the next microinstruction to be 
excuted will be at location FETCH + 1. 


Summary 


In this design shown in Figure 31, we have demonstrated some of 
the addressing schemes mentioned in Chapter 1. We used the 
ADD instruction throughout these examples, but any other arith- 
metic or logic instruction can be executed, In exactly the same 
manner by changing the microcode bits 1-4 to the appropriate 
ALU code. 


The reader Is encouraged to write several microcode-lines to 
execute the other addressing modes mentioned in Chapter 1. He 
will discover that when the result of the macroinstruction is to be 
written into main memory, the overlapping instruction-fetch is not 
feasible. in some cases, when the MAR no longer contains the 
Program Counter value, an additional microstep is needed in 
order to restore the Program Counter into the MAR. The reader is 
again encouraged to modify location FETCH in order to save this 
additional microstep. 


Appendix 


Throughout Chapter 3, a number of AC calculations have been 
made to show typical speeds for an AM2901A and Am2903 16-bit 
ALU configuration. This Appendix shows the latest SWITCHING 
CHARACTERISTICS for the Am2901A and Am2903. 


The typical data on the Am2901A shown In this Appendix super- 
sedes that shown on page 2-12 of the Am2900 Family Data Book 
dated 4-78 (AM-PUBO003). The only difference between the data 
shown in the typical column of the switching characteristic and 
this Appendix appears in Table 3. The typical carry in set-up time 
should be 40ns.’ 


The typical switching characteristic data for the Am2903 as 
shown In this Appendix supersedes the data presented in the 
Am2903 Bipolar Microprocessor Slice/Am2910 Microprogram 
Controller Data Booklet dated 3-78. Here, a number changes 
have been made to the table for both the combinatorial propaga- 
tion delays and the set-up and hold times. 


Should any questions arise concerning the switching characteris- 
tics for either the Am2901A or Am2903, please do not hesitate to 
contact the AMD factory and ask for Bipolar PCO Pe seo 
Marketing or Bipolar Microprocessor Applications. 
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Am2901A — (MAY 18, 1978) 


ROOM TEMPERATURE 

SWITCHING CHARACTERISTICS 

(See next page for AC Characteristics over operating range.) 
Tables |, Il, and Ill below define the timing characteristics of 
the Am2901A at 25°C. The tables are divided into three types 
of parameters; clock characteristics, combinational delays 
from inputs to outputs, and set-up and hold time require- 
ments. The latler table detines the time prior to the endof the 
cycie (1.¢., clock LOW-to-HIGH transition) that each input must 
be stable to guarantee thal the correct data is wntten into one 
of the intemal registers. 


All values are at 25°C and 5.0V. Measurements are made at 
1,.5V with Viv = OV and Vi, = 3.0V. For three-state disable 
tests, C, = S.0pF and measurement ts to 0.5V change on 
output voltage level. All outputs fully loaded. 


TABLE | 
CYCLE TIME AND CLOCK CHARACTERISTICS 


Read-Modity-Write Cycle pe 


{ume from selection of 
A, B registers to end of 
cycle) 


Maximum Clock Frequency to 
Shift O Register (50% duty 
cycle) : 
Minimum Clock LOW Time | -30ns_— 


Minimum Clock HIGH Time 
Minimum Clock Period 


TABLE I 
COMBINATIONAL PROPAGATION DELAYS (all in ns, CL = SOpF {except output disable tests)) 


| 
ae «| loafer 


RAMg| 
RAM3' 
| 50 | 60 { - [75 | 75 | 70 | 59 | 


D (arithmetic mode} 


F=0 
RL? 
270 


| 30 | 40 | - ff 39 f 37 fai [31 | 


| 40 | ~ || 36 [34] - | - | 


B Dest. 2a 
D {arithmetic mode) 


O(I = X37) (Note 5} 


RAMo, 3. O90, 3 


Notes 1. See next page. 


' . 

2. If the B address 1< used a8 @ source Operand, allow for the “A, B source” set-up time, if it Is used only for the destination address, use the 
“B dest.” set up time 7 

3. Where two numbers are shown, both must be mat. 

4 “tow 1s tha clock LOW tme 

5. OV 018 the fastest way to load the RAM from the D inputs. This function is obteined with | » 337. * 

6 Using O register as source operand in ansthmetic mode Clock 1s not normally in critical speed path when Q is not a source. 


A. Am2903 SWITCHING CHARACTERISTICS (TYPICAL ROOM TEMPERATURE PERFORMANCE) — (MAY 18, 1978) . 


_ Tables IA, IIA, and IIIA define the nominal timing characteris- 
ics of the Am2903 at 25°C and 5.0V. The Tables divide the 
parameters into three types: pulse characteristics for the 
clock and write enable, combinationa! delays from input to 
oOulpul, and set-up and hold times relative to the clock and 
write pulse. , ; 


Measurements are made at 1.5V with Vy = OV and Viq = 
3.0V. For three-state disable tests, C, = 5.0pF and mea- 
surement is to 0.5V change on output voltage level. 


TABLE IA — Write Pulse 


Minimum Clock HIGH Time 


TABLE IIA — Combinational Propagation Delays (Alt In ns) 
Outputs Fully Loaded. CL = SOpF (except output disable tests) 


DA, DB Inputs 


Clock 58 57 


] 


Notes: 1. Applies only when leaving special functions. 


b. pe 
OEB EnapierOisabie | - | - [| - | - | - | - [saj - | 
OEY EnabieiDisabie[vavaial - [ - [| - [- [ - | - | - | - [- | = | 
Bt} SSH 
ae ae 


—~__To Output StO9 
From Input ——~__ Casa QlOo, AIO; (Parity) 
A, B Addresses 87 
(Arith. Moda) 
A, B Addresses 56 
(Logic Mode) 


Minimum Time CP and WE both LOW 
to write 


Minimum Clock LOW Time 


and Clock Characteristics 


oe 


2. Enable/Disable. Enable is defined as output active and correct. Disable is a three-state output tuming off. 
3. For delay from any input to Z, use input to Y plus Y to Z. 

TABLE IIIA — Set-Up and Hold Times (Al! in ns) 

CAUTION: READ NOTES TO TABLE II. NA = Not Applicable; no timing constraint. 


With Respect to 
to this Signal 


WE HIGH 
WE LOW 
A.B as Sources 


B as a Destination 


AIO, AIO; 


TEN LOW 


HIGH-to-LOW LOW-to-HIGH 


| -3 | To store Y in RAM or Q 
| 0 | To Prevent Writing 


To Write into RAM 


Twa | Seonoiea 


[roe | woes | -3 
Ce ee 
= 
= 


-5 


To Write Data only into 
the Correct B Address 
To Shitt Q 


To Prevent Wniing into Q 
To Write into O 


seg Se 


Chapter IV 
The Data Path — Part Il 


owe ern . 
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CHAPTER IV 
THE DATA PATH 


The previous CPU example (See Chapter Ill) utilized SSland MSI 
components to accomplish the shift-linkage, carry control, and 
status register functions associated with the ALU. These func- 
tions can all be implemented with the Am2904 status and shift 
control unit. 


The Am2904 is an LSI device that contains all the logic necessary 
to perform the shift and status control operations associated with 
the ALU portion of a microcomputer. These operations include 
storage for ALU status flags; carry-in generation and selection; 
Gata-path, carry bit linkage for shift/rotate instructions; and status 
condition code generation and selection. The ALU status flags: 
carry, Zero, negative, and overflow; may be stored in either of two 
registers, a machine status register or a micro status register. The 
Carry-in multiplexer can select the true or complement of the 
microstatus Carry flag or machine status carry flag, as well as an 
external carry, a logical one, or a logical zero. The shift linkage 
multiplexers provide paths to rotate/shift single and double length 
words up, down, around the carry flag, and through the carry flag. 
The status condition code multiplexer provides tests on the true or 
complement of any status flag, as well as more complicated 
logical combinations of these flags to facilitate magnitude com- 
Parisons on unsigned and two's complement numbers, and nor- 
malization operations. 


STATUS REGISTERS 


The status registers contained in the Am2904 are shown in the 
upper portion of Figure 1. Each register is independently con- 
trolled by a combination of instruction signals and enable signals. 


MICRO STATUS REGISTER (SR) 


The »SRis enabled when the CEx signalis low. When CEz is low 
the instruction present on Is through Ip will be executed on the 
LOW to HIGH transition of the Clock input. These instructions fall 
into three main categories: Bit Operations, Register Operations 
and Load Operations. 


The bit operations allow individual bits of the %SR to be set or 
reset. (See Table 1.1). 


The register operations allow the 4SR to be loaded from the 
machine status register, to be set to ail one’s, reset to all zero's, or 
swapped with the machine status register. (See Table 1.2). 


The load operaticns allow the 4SR to be loaded from the | inputs 
directly, from the | inputs with ic complemented, or from the | 
inputs with overtiow retained, loyv—a + Hova — Hova (See Table 
1.3). The load operation with !¢ complemented can be used to 
emulate machines which use direct subtraction and thus need to 
complement the carry to obtain a borrow. The load with overtlow 
retained allows a series of arithmetic instructions to be executed 
without the need for a check for overflow after each instruction. If 
an overflow occurred at any time during the series it will be 
“trapped.” Thus a single test for overtiow, at the end of the series, 
is all that is required. 


MACHINE STATUS REGISTER (MSR) 


The MSR is enabled when CEy, is low. If CEyy is low the in- 
struction present on Is through Ip will be executed on the LOW 
to HIGH transition of the Clock input, Additicnally the indwidual 
bits of the MSR may be selectively enabled through the use of 
the Enable inputs Z , Ec. Ey and Enya (See Figure 1). This 
allows all possible combinations of the four status flags to be 
selectively operated on for maximum flexibility. Thus the in- 
struction specilicd by Is-lq only eflect the enabled status flags. 


‘ova oO 


thy Mic My Be 


Figure 1. Am2904 Block Diagram. 


The MSR instructions fall into two main categories: register op- 
erations and load operations (bit operations can be implemented 
through the use of the selective enable control lines). 


The register operations allow the MSR to be loaded from the 
bi-directional Y port, or the nSR. Additionally the MSR may be 
set, reset, or complemented (See Table 2.1). These three in- 
structions, combined with the selective enables. allow any com- 
bination of MSR bits to be set, reset, or compiemented. 


The load operations allow the MSR to be loaded directly from the | 
inputs, from the | inputs with I; complemented, orfrom the | inputs 
for shift through overflow (See Table 2.2). The load with Ic com- 
plemented can be used to produce a borrow. The load for shilt 
through overflow loads the zero flag and the negative flag from 
the | inputs while swapping the overflow and carry flags. This 
allows the shift through overtlow operation to be easily im- 
plemented. 


SHIFT LINKAGE MULTIPLEXERS 


The shitt linkage multiolexers control bi-dire stional sh: tines 
S$lOn, StQg (RAM shifter on the Am2903) and Q!On, Q Op (Q 
register shilter on the Am2903). To enable the shift !nk2c 2 rau 
uplexers the shi't enabie line SE must be low. When SE is 'ow the 
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TABLE 1. MICRO STATUS REGISTER 


INSTRUCTION CODES. 


Tabie 1-1. Bit Operations. 
uSR 


Comments 
Operation 


RESET ZERO BIT 

SET ZERO BIT 

RESET CARRY BIT 
SET CARRY BIT 
RESET SIGN BIT 

SET SIGN BIT 

RESET OVERFLOW BIT 
SET OVERFLOW BIT 


Qo 
1 
0 
1 
oO 
1 
oO 
1 


Table 1-2. Register Operations. 


at | cnmen | _ comme 


LOAD MSR TO »SR 
SET »SR 
REGISTER SWAP 
RESET »SR 


Table 1-3. Load Operations. 


uSR i 
Operation 


Iz > wz 
Ic > uc 
In > oN 
lova + Hovr -> HovA 
lz > uz 
le > pe . 
In > BN 

lova > Hovr 
> uz 
le > uc 

In > BN 

lova > HovaA 


LOAD WITH 
OVERFLOW RETAIN 


LOAD WITH 
CARRY INVERT 


LOAD DIRECTLY 
FROM 


$2. Ie. Ins lovi 


Note: The above tables assume CE is LOW. 


shift linkage data path will be set-up depending on the state of _ 


instruction lines ty» through I, (See Table 3). These instructions 
allow single length or double length shifts/rotates either up, or 
down. Additionally shifts/rotates may be done through or around 
the MSR carry and negative flag. Special operations exist to 
provide support for add and shift (multiply) instructions. These 
instructions select the present carry Ic (for unsigned rnultiply), 
or the Exctusive-OR of the sign flag |, with the overflow flag 
love (for two's complement multiplication). 


CONDITION CODE MULTIPLEXER 


The condition code muttiplier selects one of sixteen possible 
logical combinations of the x1SR, MSR or | inputs, depending on 
the state of the {<-Ip input lines. These combinations include the 
true or complement form of any individual bit in the 7SR, MSR or! 
inputs. Additionally several more complicated logical operations 


may be performed to provide magnitude tests on both two's © 


complement numbers and unsigned numbers. Tabie 5 lists the 
conditional test outputs (CT) corresponding to the state of the Is-lg 
instruction lines. Table 6 lists the possible relations between two 
unsigned or two's complement numbers and the corresponding 
Status and instruction codes. The three-state ‘conditional test 
output CT Is active only if OE gr is low. 


CARRY IN MULTIPLEXER 


The Carry output can be selected from one of seven different 
sources depending on the state of instruction input lines. The 
seven possible sources are: logical zero, Iggical one, the 4SA 
carry flag, the complement of the n.SR cary lag, the MSR carry 
flag, the complement of the MSR camry flag, or the extemal cary 
input Cy (See Table 4). 


* TABLE 2. MACHINE STATUS REGISTER 
INSTRUCTION CODES. 


Table 2-1. Register Operations. 


| oman |__ommeme_| 


LOAD Yz. Yc. Yn. Yova 
TO MSR 

SET MSR 

REGISTER SWAP 
RESET MSR 


INVERT MSR 


Table 2-2. Load Operations. 


MSR 
Operation 


LOAD FOR SHIFT 
THROUGH OVERFLOW 
OPERATION 


Iz > Mz 
lo > Mc s 
In > Mn 
lova — Movr 
lz > Mz 
I¢ > Mc 

In > My 
lova > Mova 


LOAD WITH 
CARRY INVERT 


LOAD DIRECTLY 
FROM lz, I¢ 
In. lover 


Note: 1. The above tables assume CEyy, Ez, Ec. En. Eovn area LOW. 


Y INPUT/OUTPUT LINES 


The bi-directional Y data lines may be used for extra data input 
lines when the Y output buffer is disabled (OEy high). 
Additionally, when Is-lg are jow, the Y buffer is disabled, irre- 
spective of the OFy signal. When the Y buffer is enabled (OEy 
is low) the Y data lines are selected from the MSR, uSR, or | 
input lines depending on the state of instruction lines Is and I, 
(See Table 7). 


TABLE 3. SHIFT LINKAGE MULTIPLEXER INSTRUCTION CODES. 


S10, | SiO, |al0, | alo, 


MSB LSB MSB LSB 


0-1 
ee 
se 
0 +=} 
O—=}—1 + 
O.-=}—+}+ 


Ql0, 
In © lova 
aio, 


MSB LSB MSB LSB 


-=}: 
O—?}: 5+: 
O -—=}- ++: 


N N N N N N N N N N N N N N N WN 


Notes: 1. Z = High impedance (outputs off) state. _ 3. Loading of Mg from I19.g overrides control from Isp. CEm,. Ec- 
2. Outputs enabled and Mc loaded only if SE is LOW. 
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TABLE 4. CARRY-IN CONTROL MULTIPLEXER INSTRUCTION CODES. 


(an @xova) + uz Un Ouxover) + uz (My @ Moya) + Mz 
tn@rovn~)* Az (un@xova) az (Mn@ Moya): Mz 
uN@HovA unN@xovA 

unOuovr uQ¥ova 

HZ uZ 

#z Bz 


~~ et wt we we ww OOnadacadadnnd 


t) 
1 
2 
3 
4 
5 
6 
7 
8 
9 . 
A 
8 
c 
oO 
E 
F 


Notes: 1. © Represents EXCLUSIVE-OR © Represents EXCLUSIVE-NOR or coincidence. 


For Unsigned Numbers 


(In @ lova) + tz 
Un® loved: Iz 
In© lova. 


INO lova 


© = Excuse OR H- HIGH Note For Am2910, the CC input is active LOW, so use 13.9 code to produce 
Q= 


Exclusive NOR L = LOW CT = L for the desired test. 
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TABLE 7. Y OUTPUT INSTRUCTION CODES. TABLE &-1. STANDARD DEVICE SCHOTTKY SPEEDS. 


Device and Path 


S-REGISTER 
Clock to Output 
GE to Output 


Set-up . 


Am2902A 

Cn to Cn+x, Y, Z 
G, P to G, P 

G, P to Cn+x, Y, Z 


Notes: 1. For the conditions: 
Is, le. Ig, ta, 14, Ip are LOW, Y Is an input. 
OEy is “Don't Care” for this conditon. 
2. X 1s “Don't Care” conditon. . 


we wwe ee 


TABLE 8-2. 
PRELIMINARY SWITCHING CHARACTERISTICS. 


Combinational Delays (ns) 


TIMING ANALYSIS 


In the previous chapter a timing analysis was presented with the 
shift-linkage, carry-control, and status registers implemented in 
SSI and MSI. This timing analysis will be repeated with the SSI 
and MSI logic replaced with the Am2904. Tables 8.1, 8.2, 8.4 and 
8.5 list the typical AC charactenstics of the registers, Am2902A, 
Am2901A, Am2903, and Am2904 used in these calculations. 
Table 8.3 lists the assumed AC charactenistics for the set-up time 
of the Am2904. 


Figure 2 illustrates the timing analysis for an Am2901A based 
design. The analysis begins with the LOW to HIGH transition of 
the system clock. All signals must be valid for the next LOW to 
HIGH transition of the system clock, i.e. one-microcycie later. 


Figure 3 illustrates a similar timing analysis for the Am2903. The 
results of both analysis are listed in Table 9. 


Ic. ty. lova 
USING THE Am2904 IN A 16-BIT DESIGN S10, O10 
n- a 
Perhaps the best technique for understanding the Am2904 is to 
simply compare 16-bit ALU designs with and without the 
‘Am2904. The first-design, Figure 4a, is an example of a ‘6-bit 
CPU design using SSI/MSI parts instead of the Am2904. In 
Figure 4b, the second 16-bit CPU design, the Am2904 is shown SI0,. SIO, 
replacing the SSI/MS!. The Am2904 substitutes for the appro- QIN, AiO, ' 
pnate shift matnx control and status registers. A more detailed : 
comparison may be obtained by referring to the 16-bit ALU de- 
signs in Chapter III and the one in Appendix C of this chapter. To 
understand the Am2904 further, the usage of the Am2904 [s 


described through the microprogram bits in the microprogram IOVA, IZ. INIC 
Structure and shown later in the actual microprograms. jaune 


$105. SIOp 
Q10,. QIO”_, 


The actual set-up times where not available at the time this was written, 
See current data sheets for correct uming on these signals. 
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(MAY 18 . TABLE 8-4, 
Am2S01A — 18, 1978) 
TABLE | 
ROOM TEMPERATURE 
SWITCHING CHARACTERISTICS CYCLE TIME ANO CLOCK CHARACTERISTICS 


Tables |, Il, and HI below define the timing characteristics of 
the Am2901A at 25°C. The tables are divided into three types 
of parameters; clock characteristics, combinational delays 
from inputs to outputs, and set-up and hold time require- 
ments. The latter table defines the time prior to ihe endol the 
cycle (i.e., clock LOW-to-HIGH transition) that each input must 
be stable to guarantee that the correct data is written into one 
of the intemal registers. 


TIME TYPICAL |GUARANTEED 


Read-Modify-Write Cycle 
(time from selection of 
A. B registers to end of 
evcle). 


Maximum Clock Frequency to 
Shift Q Regiszer (50% duty 
cycle) 


Minimum Clock LOW Time 
Minimum Clock HIGH Time 


Minimum Clock Period 


All values are at 25°C and 5.0V. Measurements are made at 
1.5V with Vj, = OV and Vy = 3.0V. For three-state disable 
tests, C, = 5.0pF and measurement is to 0.5V change on 
output voltage level. All outputs fully loaded. 


r TABLE Il 
é COMBINATIONAL PROPAGATION DELAYS {all In ns, Ci = 50pF (except output disable tests)) 
\° TYPICAL 25°C, 5.0V 
\ 
A bypassing 
ALU (I = 2xx) 
_¥ (Note 6) 
TABLE tlt 
SET-UP AND HOLD TIMES (all in ns) (Note 1) 
TYPICAL 25°C, 5.0V GUARANTEED 25°C, 5.0V 
t 


B Dest Spul +15 
70 


D {arithmetic mode) 
Dil = X37) (Note 5) 


A,B 40 93 
sone | 3's | tots | ee ee 
pede 2a eae SE 
La coer 
aes! 
ieee 


Noss: 1. See next page. F 
2. if the B addrass is used as @ source operand, allow for the “A, 8 source” set-up time; If it la used only for the destination address, use th: 
"B dest.’ set-up time. 
. Where two rumbcra are shawn, both must be met. 
- tpwk” ta the clock LOW time, 
- OVO is the fastest way to load the RAM from the D Inputs. This function Ja obtained with | = 337. 
. Using O register as source operand in arithmetic mode. Clock is not normally in critical speed path when Q is not a source 


TABLE 6-5. 


' A. Am2903 SWITCHING CHARACTERISTICS (TYPICAL ROOM TEMPERATURE PERFORMANCE) - (MAY 18, 1978) 


Tables IA, HA, and IIIA define the nominal timing characteris- 
tics of the Am2903 at 25°C and 5.0V. The Tables divide the 
clock and write enable, combinational delays from input to 
output, and set-up and hold times relate to the clock and 
write pulse. 

1 Measurements are made at 1.5V with Vy = OV and Viy = 
3.0V. For three-state disable tests, C, = 5.0pF and mea- 
surement is to 0.5V change on output voltage level. 


TABLE IA — Write Pulse and Clock Characteristics 


Minimum Time CP and WE both LOW 
to write 


Minimum Clock LOW Time 
Minimum Clock HIGH Time | 35ns_— 


Time 
i Wi 


TABLE NA — Combinational Propagation Delays (All In ns) 
Outputs Fully Loaded. CL = S0pF (except output disable tests) 


DA, OB Inputs 


Notes 1. Applies only when leaving special functions. 


To Output SI09 
From Input —-—~__ Ql0g, Q!05 (Parity) 
A, B Addresses 
(Anth. Mode) 
A, B Addrasses 
(Logic Mode) 


pee | 
ae Re oe ee ee ee ee 


TE) 
ae 


2. Enable/Disable. Enable is defined as output active and correct. Disable is a three-state output tuming off. 


3. For delay from any input to Z, use input to Y plus Y to 2. 


TABLE II[A — Set-Up and Hold Times (All in ns) 
CAUTION: READ NOTES TO TABLE Ill. NA = Note Applicable; na timing constraint. 


With Respect to 
to this Signal . 


ae a (eel 
re 
cece Sid 
cece 


Clock and WE both LOW 


HIGH-to-LOW 


PNA {| 9 | -3 | 
| Note2 | Note2 | 0 | 
we ee 
| -3 [| NA | Na | 
To Write Data only into 
jp -+ [mawe | rows | -3 | Reeoncteteatee | 
re 
| 2. [Notes | Notes | -18 | 
oe tert 
[| NA | | fs | 


To store Y in RAM or QO 
To Prevent Writing 
To Write into RAM 


To Shift Q 
To Prevent Writing into Q 


aa ries * 
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DATA 
REGISTER 


DA,0B A, 84) 


[ramp Ser 


DATA OUT 4 
REGISTER 


PATH 1. ——————_—_—_—__- 


PATH2 ---- —- ——-—- 
PATHS) —— -—— — —— 


Figure 2-1. 


ARITHMETIC OPERATION 
SPEED COMPUTATIONS 


| DEVICE KO. DEVICE PATH PATH PATH 2 PATH) | OATYA OUT J 
cPba ? REGISTER 

40 

s 


4 


| 
| 
| 


Figure 2-2. 


LOGIC OPERATION WITH SHIFT 
SPEEO COMPUTATIONS 


DATA OUT r 
REGISTER 


Figure 2-3. 


INSTRUCTIONS 


TWO'S COMPLEMENT 
ARITHMENC OPERATION 
WITH SHIFT DOW/N SPEEDO COMPUTATIONS 


SREG cPwmo 


2A AD 2 GF DATA OUT q 
20028 GP to Creny? REGISTER 
POA Cow Ff) OVA { 

Tas IN, IOVA B SOA i a 

Z001A SCT UP RAM) t is 

Z20IA Cawy puire s - 

ZwOIA Cn Zero | - 

2906 SETUPE - 

HAG i SeT-uP OD L 


TOTAL. oul wa 


Figure 2-4. 
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MAGNITUOCE ONLY ARITHMETIC OPERATION 
WITH SFT DOWN SPEED COMPUTATIONS 


PATH 1 
PATH 2) — 


Figure 2-5. 


jp ae ana cae eee | 


Hf} _______ | 


DATA OUT 4 
REGISTER 


Figure 3-1, 
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& 


TTT 


Rt = —— 


Figure 3-2. 


it —_—_————— 


‘PATH 2 -——— —-—-— 
PATH3) ~~ —- — —— 


Figure 3-3. 
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ARITHMETIC OPERATION 
{ TWO'S COMPLEMENT 
WITH SHUFT DOWN - 16-BIT SPEED COMPUTATIONS 


- 


ECT CEL ELE 


Figure 3-4. 


OA,0B ABI 


MAGNITUDE ONLY ARITHMETIC OPERATION 
WITH SHAFT DOWN SPEED COMPUTATIONS 


PATHS | PATH2 


GP Creat a 
Caw res ri 
1359, 

Sour 

Yet 

<OT Pe 

serupy 


Figure 3-5. 
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Bits PLO 
through PL11 


Bit PL12 


Bit PL13 


TABLE 9. TIMING ANALYSIS SUMMARY (ns). 


Arithmetic 
Logic w/Shitt 


Two's Complement, 
Anthmetic with 
Shitt Down 


Magnitude only 
Arithmetic with 
Shift Down 


THE MICROPROGRAM STRUCTURE 


The functions of the pipelined (PL) microprogram bits are Ii- 
lustrated in Figure 5 and as follows: 


This is a shared control field. The field is used 

for branching to a microprogram address or to 

load the CCU counter or control bits for VO. 
The shared control field is determined by 
PL12, LOW for branching and counting or 
HIGH for VO control. 

When LOW, enables the WRITE output and 
allows the Q Register and Sign Compare thip- 
flop to be witten into. 


Figure 4b. 


Bits PL14 
and PL15 


Bits PL16 
through PL19 


Bits PL20 
through PL23 


Bit PL24 


Bit PL25 


Bits PL26 
through PL29 


Bits PL30 
through PL32 


PLAMPCA] PLae [ ALU Operand R | ALU Operands 


The CEy and SE cortrol inputs of the Am2904, 
respectively. CEx enables the Micro Status 
Register. enables the Am2904 shift opera- 
tions. 


CCU Next Address. 

CCU Multiplex test select. 

This bit determines the polarity of the incoming 
test signal to the CCU. 


Active LOW Instruction Register enable. 
CCU multi-way branching select. 


Selects the ALU operand sources. , 


RAM Output B 
OBp.3 

Q Register 
RAM Output B 
DBo.3 

Q Register 


H = HIGH X = Dont Care 


Bits PL33 
through PL36 


ba 
i | 
LH 
Ga 
| 
LH 
Ga 


l= LOW 


Bits PL37 
through 40 


4 Y's 's 
mF 7 
oe oe eee | 
L L Hw t 
a oe eer 
a ee 


el 


x! xj 
| xj 
ze 


| 
| 


rie 
er 
xe 


,% 


' 
zie} ze 


rt 


a 
if 


Selects the ALU functions. 


F = R Minus S Minus 1 Plus Cy, 
F = R Plus S Plus C, 

F aS Plus Cy, 

F; = R; EXCLUSIVE NOR S; 

F; = R, EXCLUSIVE OR S; 

F, = Rj NOR S,; 

F, = Rj NAND S; 


i=003 


Pas H = HIGH 


~~” Selects the ALU destination controls. 


=v 
S10g~¥o Ys. 2 > | 
a 


Bits PL41 
through PL44 


Bits PL45 
through PL48 


Bits PL49 
through PL52 


Bit PLS3 


Bits PL54 
through PL59 


Bits PL6O 
through PL63 


Bits PL64 
and PL65 


Bits PL66 
through PL68 


Bit PL69 
Bit PL70 
Bit PL71 


Bit PL72 
Bit PL73 * 


Bits PL74 
through PLB9 
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This 4-bit wide field is used for the A-address 
source. 


* This 4-bit wide field is used for the B-address 


source. 


This 4-bit wide field is the B destination ad- 
dress into which new data {s written. 


Am2903 control input OEy. When LOW enables 
the ALU shifter output data ontd the Y bus. 


Am2904 instruction code field. 
Am2904 shift linkage multiplexer instruction 
code field. 


Am2904 “carry-in” control multiplexer field. 


The CEy, OEcr, GEy-control inputs of the 
Am2904, respectively. 


This bit when LOW, enables bits PL74 through 
PLB89 onto the Am2903 DA Bus. 


When LOW, zeros the carry in’s to the Am2903 
slices. 


When HIGH, enables a status register used in 
BCD calculations. 


When LOW, clears the status register. 
When LOW, enables Am2909/11 registers. 


| 
This field contains a 16-bit constant from m 


crocode that is passed to the Am2903's via 
the DA bus. Constant is enabled by PL69. 


Ip OR 1; OR Iz OR Ig = HIGH, Ten = LOW 


The Am2903 special functions can be select@@by the following conditions: Iq = ty = lp = ly = ty = LOW, len = LOW 
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SOME SAMPLE MICROROUTINES 


The following algorithms are implemented using the Am2903 
Superstices™ and Am2S04 status and shift controt unit. The 
algorithms were developed with the aid of AMDASM on System 
29. Ali algorithms assume values and constants to be initialized 
prior to the entrance of the algorithms. Appendix A relates the 
actual microcode to the microword fields. Appendix B is the 
AMDASM Phase 1 and Phase 2 listings of the microprograms 
ainienre and the definitions of mnemonics. Figure 4b is a block diagram 
awa of the CPU hardware including the Am2904 Status and Shift 
wasnivis': Control Unit from which the microroutines were developed. A 
detailed diagram of the CPU hardware is in Appendix C. - 


SHARED CONTROL FIELD 
cea Ss eee 


11109 6 


Normalization, Singie- and Double-Length 


Normalization is used as a means of referencing a number to a 
fixed radix point. Normalization strips out all leading sign bits such 
that the two bits immediately adjacent to the radix point are of 
opposite polanty. 

Normalization is commonly used in such operations as fixed-to- 
floating point conversion and division. The Am2903 provides for 
normalization by using the Single-Length and Doubie-Length 
Normalize commands. Figure 6a represents the Q Register of a 
16-bit processor which contains a positive number. When the 
Single-Length Normalize command is applied, each positive 
edge of the clock will cause the bits to shift toward the most 
significant bit (bit 15) of the Q Register. Zeros are shifted in via the 
QIOO port. When the bits on either side of the radix point (bits 14 
and 15) are of opposite value, the number Is considered to be 
normalized as shown in Figure 6b. The event of normalization is 
externally indicated by a HIGH level on the Cn+4 pin of the most 
Significant slice (Cn+4 MSS = Q3 MSS ¥ G2 MSS). 


There are also provisions made for a normalization indication via 
the OVR pin one microcycle before the same indication is avail- 
able on the Cn+4 pin (OVR = Q2 MSS¥Q1 MSS). This is for use 
in applications that require a stage of register buttering of the 
Normalization indication. 


Since a number comprised of all zeros is not considered for 
normalization, the Am2903 indicates when wuch 4 condition 
arises. If the Q Register is zero and the Single-Length Normaliza- 
tion command is given, a HIGH level wili be present on the Z line. 
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a) Unnormatized Positive Number. 
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b) Normalized Positive Number. 
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Figure 5. Figure 6. 


The sign output, N, indicates the sign of the number stored In the 
Q register, Q3 MSS. An unnormalized negative number (Figure 
7a) is normalized in the same manner as a positive number. The 
results of single-length normalization are shown in Figure 7b. The 
device interconnection for single-length normalization is outlined 
in Figure 8. During single-length normalization, the number of 
shifts performed to achieve normalization can be counted and 
stored in one of the working registers. This can be achieved by 
forcing a HIGH at the Cn input of the least significant slice, since 
during this special function the ALU performs the function [B] + 
Cn and the result is stored in 8. Figure 9 illustrates the single- 
langth normalize. However, the microcode is shown in Figure 10. 
Microcode for both single and double normalization can be re- 
duced by one step by testing for zero during passing of number 
into Q. 


Nonnalizing a double-length word can be done with the Double- 
Length Normalize command which assumes that a user-selected 
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a) Unnormalized Negative Single Length Number. 
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RAM Register contains the most significant portion of the word to 
be normalized while the O Register holds the least significant half 
(Figure 11.) The device interconnection for couble-length nor- 
malization is shown in Figure 12. The Cn+4, QVR, N, and Z 
outputs of the most significant slice perform the same functions in 
double-jength nonnalization as they did in single-length normaii- 
Zation except that Cn+4, OVA, and N are derived from the output 
of the ALU of the most significant slice in the case of double- 
length normalization, instead of the Q Register of the most sig- 
nificant slice as in single-length nomnalization. A high-level Z line 
in double-fength nonnalization reveals that the outputs of the ALU 
and Q Register are both zero, hence indicating that the double- 
length word is zero. 


When double-length nomnalization is being performed, shift 
counting is done either with an extra microcycile or with an exter- 
nal counter. Figure 13 illustrates the double-length normalize 
flowchart and Figure 14 shows the microcode. 


QREGISTER 


b) Normalized Negative Single Length Number, 


Figure 7. 
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Figure 8. Single Length Normalize. 


Unsigned Multiply 


This Special Function allows for easy implementation of unsigned 
multiplication. Figure 15 is the unsigned multiply flow chart. The 
algorithm requires that initially the RAM word addressed by Ad- 
dress port B be zero, that the multiplier be in the Q Register, and 
that the multiplicand be in the register addressed by Address port 
A. The initial conditions for the execution of the algorithm are 
that: 1) register R, be reset to zero: 2) the multiplicand be in Ro 
and 3) the multiplier be in Ry5. The first operation transfers the 


multiplier, Rys, to the O Register. The Unsigned Multiply instruc- 
tion is then executed 16 times. During the Unsigned Multiply 
instruction, R1 is addressed by RAM address port B and the 
multiplicand is addressed by RAM address port A. 


When the unsigned Multiply command is given, the Z pin of 
device 1 becomes an output while the Z pins of the remaining 
devices are specified as inputs as shown in Figure 18. The Z 
output of device 1 is the same state as the least significant bit of 
the multiplier in the Q Register. The Z output of device 1 informs 
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Figure 9. Single Length Normalize. 


SUN R2.R2.0FF & CONT & SHOLD 31930 29 28 27 2% 2 2% 2B 22 21 20 19 18 17 16 


MAZ & T & CIP & GOTO ABORT 

MAC & T & 10W AO & CJP & GOTO END 

SLN R2,R2 & MAO & T & CJP ONE & GOTO ENO & SUL 
: SUN A2.R2 & MIO & T & CJP ONE & GOTO AGAIN & SUL 

SOOQP & SMS & CONT 

SRS R2,R2.R0 & CONT 


% 413121110 8 8765 € 3 21 6 


Figure 10. . Figure 11. Double Length Word. 
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Figure 12. Double Length Normalize. 


START 
LS NUMBER INO 
MS NUMBER IS AS. 


OOUOLE LENGTH 
NORMALIZE 
ALU DISABLE 
SHOLD 


OOUDLE LENGTH 
NORMALIZE 
Qo 0 


ENO 

NORMALIZED 
LS NUMGER INQ 
MS NUMBER IN R15, 

EXPONENT NUMBER 
IN A2. 


Figure 13. Double Length Normalize. 


OLN R15.R15,0FF & CONT & SHOLD 

MAZ & T & CJP & GOTO ABORT 

LOW R2 & MAC &T & CJP & GOTO END2 

DLN R15,R15 & SOUL & MAO & T & CJP & GOTO JUMP! 


DLN 915.R15 & SOUL & MiO & T & CJP & GOTO JUMP! 
PAR R2,.R2 & JP ONE & GOTO LOOPS 

PAR R2.R2 & CONT ONE 

SORO R15. R15 & SOMS & END 


Figure 14. 


START 
MULTIPLICANO IN RO 
MULTIPLIER IN RIS 


ENO 
PROOUCT (MS) IN RI 
PROOUCT (LS) INO 


Figure 15. Unsigned 16 X 16 Multiply. 


the ALUs of all the slices, via their Z pins, to add the partial product 
{relerenced by the B address port) to the multiplicand (referenced 
by the A address port) if Z = 1. If Z = 0, the output of the ALU is 
simply the partial product (referenced by the B address port). 
Since Cn is held LOW, it is not a factor in the computation. Each 
positive-gaing edge of the clock will internally shift the ALU out- 
puts toward the least significant bit and simultaneously store the 
shifted results in the register selected by the B address port, thus 
becoming the new partial sum. During the down shifting process, 
the Cn+4 generated in device 4 1s internally shifted into the Y 
position of device 4. At this time, one bit of the multiplier will 
down shift out of the QIO9 ports of each device into the QiO3 
port of the next less significant slice. The partial product is 
shifted down between chips in a like manner, between the SIO 
and SIO; ports, with SIO, of device 1 being connected to QIO3 
of device 4 for purposes of constructing a 32-bit jong register to 
hold the 32-bit product. Shifting of the partial product between 
the B address and QO regisiers are accomplished via the 
Am2904. At the finish of the 16 x 16 multiply, the most signif- 
cant 16 bits of the product will be found in the register refer- 
enced by the B address lines while the least significant 16 bits 
are stored in the Q Register. Using a typical Computer Control 
Unit (CCU), as shown in Appendix C, the unsigned multiply 
operation requires onty two lines of microcode, as shown in 
Figure 16, and is executed in 17 microcycles. 


LOPT R15 & F & GRD & PUSH & COUNT O0E 


UMUL R1.R1,R0 & F & CNT & SODL & RFCT 


Figure 16. 
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Two's Complement Multiplication 


The algorithm tor two's complement multiplication is illustrated by 
Figure 17. The initial conditions for two's complement multiplica- MULTIPLIER IN AIS 
tion are the same as for the unsigned rhultiptly operation. The 
Two's Complement Multiply Command is applied tor 15 clock 
cycles in the case of a 16 x 16 multiply. During the down shifting 
process the tern N ¥ OVR generated in device 4 is internally 
shifted into the Y3 position of device 4, The data flow shown in 
Figure 18a is still valid. After 15 cycles, the sign bit of the multiplier 
is present at the Z output of device 1. At this time, the user must oe COMPLEMENT 
place the Two's Co lement Multiply Last cycle command on the MULTIPLY 
instruction lines. The interconnection for this instruction is shown DECREMENT.CTR 
in Figure 18b. On the next positive edge of the clock, the Am2903 
will adjust the partial product, il the sign of the multiplier is nega- 
tive, by subtracting out the two's complement representation of 
the multiplicand. if the sign bit is positive, the partial productis not 
adjusted. At this pomt, two's complement multiplication is com- 
pleted. Using a typical CCU, as shown in Appendix C, the two's 
complement multiply operation requires only three lines of micro- 
code, as shown in Figure 19, and is executed in 17 microcycles. 


TWO'S COMPLEMENT DIVISION 
ENO 
] The division process is accomplished using a four quadrant non- PRODUCT (MS) IN AT 
. restoring algonthm which yields an algebraically correct answer FROSUCE RSH G 
such that the divisor times the quotient plus the remainder equals 
the dividend. The algorithm works for both single precision and Figure 17. 2's Complement 16 X 16 Multiply. 
Fe iB) *Cat Zoo 
FeiBlolAleCyazed 
DEVICES DEVICE? DEVICE 1 
Note: For unsigned multiply, Cn + 4 MSS is internally shifted into position Y3 MSS: 2’s complement multiply NYOVR 
és internally shifted into position Y3 MSS. 
a) Multiply. 
Fe[B) ¢CatZ=o 
Fo (8)-[A]-teCydZot Lop F/2-Y.8 a2z-a 
/ 
I 
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Note: N ¥ OVR is internally shifted into position Y3 MSS. 


b) Ccmplement Multiply, Last Cycle. 


Figure 18. 


LOPT R15 & F & GRO & PUSH & COUNT 000 


TCM R1,R1,RQ & F & CNT & SDDL & RFCT 
TCMC R1,R1,R0 & SODL & CONT CZ 


Figure 19. 


multi-precision divide operations. The only condition that needs 
to be met is that the absolute magnitude of the divisor be greater 
than the absolute magnitude of the dividend. For multi-precision 
divide operations the least significant bit of the dividend is trun- 
cated. This is necessary if the answer Is to be algebraically 
Correct. Bias correction is automatically provided by forcing the 
least significant bit of the quotient to a one, yet an algebraically 
correct answer is still maintained. Once the algorithm is com- 
pleted, the answer may be modified to meet the user's format 
requirements, such as rounding off or converting the remainder 


50 that its sign is the same as the dividend. These format modifi- 
cations are accomplished using the standard Am2903 instruc- 
tions. 


The true value of the remainder Is equal to the value stored in the 
working register times 2°—' when n is the number of quotient 
digits. ~ ; 

The following paragraphs describe a double precision divide 
operator. 


Referring to the flow chart outlined in Figure 20, we begin the 
algorithm with the assumption that the divisor is contained in 
Ro, while the most significant and least significant halves of 
the dividend reside in A; and Ry, respectively. The first step Is 
to duplicate the divisor by copying the contents of Ag into A>. 
Next the most significant half of the dividend is copied by 
transferring the contents of R, into R2z while simultaneously 
checking to ascertain if the divisor (Ro) is zero. If the divisor Is 
zero then division is aborted. if the divisor is not zero, the 
copy of the most significant halt of the dividend in R2 ts con- 
verted from its two's complement to its sign magnitude rep- 
resentation. The divisor in Ry Is converted in like manner in 


QUOTIENT IN Q 
REMAINDER IN AI 


Figure 20. Two's Complement Division. 
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the next step, while testing to see if the results of the dividend 
conversion yie'ded an indication on the overflow pin of the 
Am2903. If the output of the overflow pin is ‘one’ then the 
dividend is -2" and hence is the largest possible number, 
meaning thit it cannot be less than the divisor. What must be 
done in this case is to scale the dividend by down shifting the 
upper and fower halves stored in R, and Ry respectively. Atter 
Scal:ng, the routine requires that the algorithm be reinitiated at 
the beginning. 


Conversely, if the oulput of the overtiow pin is not a one, the sign 
Magnitude representation of the divisor (R3) is shifted up in the 
Am2203, remaving tne sign while at the same time testing the 
results of two's complement to sign magnitude conversion of the 
divisor in the Am2910. Il the results of the test indicate that the 
divisor is —2" f.e., overflow equa!s one, then the lower half of the 
dividend is placed in the Q register and division may proceed. 
This is possible because the divisor is now guaranteed to be 
grealer than the dividend. If overflow is not a one then we must 
proceed by shifting out the sign of the sign magnitude represen- 
tation of the dividend stored in Ro. At this point we are able to 
check if the divisor is greater than the dividend by subtracting the 
absolute value of the divisor (R3) from the absolute value of the 
upper half of the dividend (Ra) and storing the results in R3. Next, 


the least significant hall of the dividend is transferred from Ry to - 


the O register while simultaneously testing the carry from the 
result of the d:visor/dividend subtraction. If the carry (Cn+4) is 


Fe [Bl oly log 2F -¥.8 


Fe (B) ¢ (Al *CalzZ-0 
F+(B]- (Al —-IeCatzZe=t 


SIGN COMPARE FF 


one, indicating the divisor is not greater than the dividend then a 
scaling operation must occur. This involves either shifting up the 
divisor or shifting down the dividend. If the carry is not one then 
ihe divisor is greater than the dividend and division may now ° 
begin. 


The first divide operation Is used to ascertain the sign bit of the 
quotient. The two's complement divide instruction is then exe- 
cuted repetitively, fourteen times in the case of a sixteen bit 
divisor and a thirty-two bit dividend. The final step is the two's 
complement correction command which adjusts the quotient by 
allowing the least significant bit of the quotient to be set to one. Al 
the end of the division algorithm the sixteen bit quotient is found in 
the Q register while the remainder now replaces the most sig- 
nificant half of the dividend in R,. It should be noted that the 
remainder must be shifted down fifteen places to represent its 
true value. The interconnections for these instructions are shown 
in Figures 21, 22, 23. Using a,typicai CCU as shown in Appendix 
C, the double precision divide operation microcode, is shown in 
Figure 24, 


For those applications that require truncation instead of bias 
correction, the same algorithm as above should be implemented 
except one additional Two's Compiement Divide instruction 


, Should be used in lieu of the Two's Complement Divide Correc- 


tion and Remainder instruction. However, this technique results 
in an invalid remainder, 


Figure 22. 2's Complement Divide. 
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LOW R10 & JSR & GOTO InP 
PAR R7,R15 & JSA & GOTO INP 
PAR At.R15, & JSR & GOTO INP 
PAR R4,R15 & CONT 

PAR R3.R7 & CONT 


PAR R2.Rt & T & MIZ & CJP & GOTO ABORT 
SMTC A2,R2 & CONT Z 

SMTC R3,R3 & T & MIO & CJP CZ & GOTO SCALE 
ALUOFF & T & MIO & CJP & GOTO SKIP6 
SURL A3.R3 & SUL & CONT 

SURL R2,R2 & SUL & CONT 

ALUOFF & JP & GOTO LOOP2 

LOPT R4 & JSR & GOTO SDIVD 

ALUOFF & JP LOOP! 

SSR R3,R2,YBUS & CONT ONE 

LOPT R4 & F & MIC & CJP & GOTO SKIPS 
ALUGFF & JSR & GOTO SOIVD 

SURL R2,R2 & SOL & CONT 

ALUOFF & JP & GOTO LOOP2 

ALUOFF & F & GAD & LDCT & COUNT 00C 
DLN R1.RT.R7 & T & GAD & SOUL & PUSH 
TOIV A1.A1,R7 & F & CNT-& SOUL & RECT CZ 
TOC R1,R1,R7 & SUH & CONT CZ 

QMOV R15 & JSA & GOTO OUTP 

PAR R15,R1 & JSR & GOTO OUTP 

ALUOFF & JP & GOTO DIV 

PAR R1,R1 & CONT 

ALUOFF & T & MIS & CJP & GOTO NEG 

PAR R1,R1,AORQ & SOOL & CONT 

ALUOFF & JP & GOTO RET 

PAR R1,R1,AORO & SOOL & CONT 

QMOV R4 & CONT 

PAR A10,R10 & ATN ONE 


Figure 24. 


NON-RESTORING BINARY ROOTS 


The algorithm for Non-Restoring Binary Roots is illustrated In 
Figure 25. The initial conditions required are: 1) the non-negative 
number to be rooted in the radicand register, R,; 2) Re has the 
positive append bits 101_; 3) Ry has the negative append bits 
0115; 4) Ry is the mask register with BFFF,,; 5) As is the partial 
register with 4000,; and 6) the counter register, Re, with the 
value 08};. . : t 
An example of the Non-Restoring Binary Root algorithm is shown 
In Figure 26. Starting at the binary point, the number to be rooted 
is partitioned into pairs: The partial value is subtracted from the 
first pair. An intermediate remainder and sign are then produced: 
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-If the remainder is positive, a 1 is entered in the corresponding 
foot bi. Then a 01 is appended to the partial, shifted and sub- 
tracted from the present remainder to produce the next remain- 
der. When the remainder becomes negative, the present remain- 
der Is not restored. A 0 is entered in the next corresponding root 
bIL Then an 11 is appended to the partial, shifted and added to the 
present remainder. The entire process is repeated until the 
partial root has developed into 8 bits or the remainder is zero. 


Refemng to Figure 26, the same method of finding the root 
applies. A starting partial value, Rs, is subtracted from the 
radicand, Ry, which produces the intermediate remainder Ao. 
During this time, the sign of the remainder is stored within the 
Am2904. Then Rg is masked by Ry to obtain the next partial value 
and R, is shifted to obtain a new mask for the next cycle. Status is 


+ obtained from the Am2904 and tested. If the ramainder Is posi- 


tive, a root bit of 1 is developed and bits 01 appended by Ro. 
When negative, a root bit of 0 is developed and bits 11 appended 
by R3. At this point Re is decremented and tested for zero. If Rg # 
0, then addition or subtraction is performed on the remainder 
depending on the sign bit stored in the Am2904. A new remainder 
is produced and cycled through the procedure again. Figure 27 
illustrates the microcode. 


BCD HARDWARE ADDITIONS 


In applications where fast BCD operations are needed the de- 
signer has the option of using a slight amount of additional 
hardware to dramatically increase the performance of these op- 
erations. These firmware/hardware trade-offs are very applica- 
tion sensitive. The hardware-firmware examples given below are 
specifically for an intensive BCD system with a large fraction of 
conventional fogic-arithmetic operations. The designer is willing 
lo reduce cycle time slightly to increase BCD thru-put. Small 
hardware additions are acceptable as long as fleubility is re- 


~ : Figure 26. Non-Restoring Binary Root Example. 


LOW R10 & CONT 

LOW RO & CONT 

PAR R1,R15 & CONT 

PAR R2,R0,,DARB & CONST 0005 & CONT 
PAR R3,R0,.DARB & CONST 0003 & CONT 
PAR R4,R0,,DAR8 & CONST H@#BFFF & CONT 
PAR R4,R0,,DARB & CONST 4000 & CONT 
PAR R6.RO,,DARB & CONST 0008 & CONT 
SAS RO,A1,A5 & CONT & SHOLO 

ANO AS.RS.R4 & CONT 

SDAL A4.A4 & MAS & CJP & GOTO END3 
SDAL RO.RO, & T & MAS & CJP & GOTO POS 
OR RS.A3 & JP & GOTO CNT 

OR R5,R2 & CONT 

SAS AG,AG,AIO & CONT 

SDRL R2,R2, & T & MIZ & CJP & GOTO ENDS 
SDRL R3.R3 & T & MAS & CIP & GOTO SUB 
ADD RO,RO.AS & JP & GOTO CYCLE & SHOLD 
SAS RO,RO.RS & JP & GOTO CYCLE & SHOLD 
JP & GOTO SORT 


Figure 27. 


The hardware additions finally decided on were chosen to in- 
crease the performance of BCD to binary conversion, binary to 
BCD conversion and BCD addition. The performance increases 
were approximately an order of magnitude in the first two cases, 
and a factor of 4 or 5 in the last case. A diagram of the additions 
(3% ICs) is given in Figure 28, 


The 74808 AND gates normally pass the carry from the 
Am2902A to the Am2903s. When microbit CZER is low the 
Carries-in are forced to zero. This is used to “disconnect” the 
carry so that atest may be done on each slice simultaneously. For 
example if a test for 5 or greater is desired a HEX B is added and 


a ee ee ee SS ee 


Figure 2B. 


the carry out of each slice will indicate the result of the test. This 
allows simultaneous tests on each individual slice and greatly 
increases thru-put. This addition increases the performance of 
BCD to binary conversion and binary to BCD conversion by at 
least an order of magnitude. The drawback to this addition is that 
the AND-gates introduce an extra gate delay in a critical path. The 
machine cycle time may be increased by about 8ns. The increase 
in BCD performance will more than offset this delay for BCD 
intensive systems. 


Another hardware addition is the Am25LS241 three-state buffer. 
This butter allows the Am2904 to be used to store the carry-out 
Status bits via the bi-directional Y bus. 


The 25LS195A is wired as a 4-bit register with clear and enable. 
This register is used to store the carry-out bits from a test cycle. 
The outputs of the 25LS195A are ORed with the output of the 
Am2904 Y-bus and connected to the Am2909 OR inputs in the 
CCU. This allows a multi-way branch on the OR of two test 
cycles, greatly increasing the performance of BCD addition. 


BCD TO BINARY CONVERSION 


The usual method of BCD to binary conversion is to divide the 
BCD number by 2. The 1-bit remainder will indicate if a 1 


existed In the BCD number. The previous division result is di- © 


vided by 2 again and the remainder will indicate if a 2 existed 
in the BCD number. In general the remainder from a division 
by 2" will indicate if a 2°~" existed in the BCD number. 


These remainders can be used to construct the binary rep- 
resentation, by2 + bp_12"7) + bp—22"7? + 22. + D427 + 
bp2°: The b,, bit is thus the remainder from division step n + 
1. The binary representation may thus be created by shifting 
the remainders down until the m-bit BCD number has been 
divided by 2 m times. . 
To divide a BCD number by 2 a down shitt is executed. The 4, 2 
and 1-bit positions will contain the correct result, but the 8-bit 
position is incorrect. its value has changed from 10 to 8 instead of 
from 10 to 5. This means the resulting BCD number will have a 
value 3 greater than it should for the division by 2 to be correct. A3 
must be subtracted from any digit in which a 1 entered its 8-bit. 


A sample conversion fs givén In Table 10. The BCD number is 
gradually shifted down and corrected when necessary. The bi- 
nary number is finally correct ater 16 cycles. 


A flow diagram for the algorithm is given in Figure 29. The BCD 
input, A, is shifted down into the binary output B, to start the loop. 
The constant 0888 is added to A with the carries-in forced to zero. 
The resulting caries-out will indicate if A contained a 1 in any of 
the 8-bit positions. These caries are saved in status register 
SR1. A multi-way branch is then executed to enter the adjust 
table. The digits are adjusted depending on the previous test. At 
the same time a shift can be executed to prepare for the next test 
instruction. A test for end of loop Is also done in this cycle to 
provide an exit if 16 iterations of the loop are complete. Finally a 
shift up of B is needed to cancel the extra right shift when the loop 
is exited. The microcode for this algorithm is given in Figure 30. 
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Digit 
1 


0000 
1000 
0101 
0010 
0010 
1001 

0110 
1011 

1000 
1100 
1001 

0100 
0100 
0010 
0010 
0001 

0001 

0000 
0000 
0000 
0000 
0000 


ESESESSRERSEEERERE 


PEEEEECEEEEES 


Dight 
0 


0100 
0010 
0010 
1001 
0110 
0011 
0011 
0001 


TABLE 


10. 


BCD —=Binary 
Result Operation 


011000 


_1011000 


SHIFT 

ADJUST DIGIT 4 
SHIFT 

ADJUST DIGITS 2, 0 
SHIFT 

ADJUST DIGIT 1 
SHIFT 

ADJUST DIGIT 1 
SHIFT 

ADJUST DIGIT 1 
SHIFT ' 

ADJUST DIGIT O 
SHIFT 

ADJUST NONE 


01011000 ~~ —s—é« SHIFT 


ADJUST > | NONE 


101011000 SHIFT 


ADJUST DIGIT Oo 


1101011000, SHIFT 


0110101 


1011010 


_» ADJUST _ — NONE 
1000 | SHIFT 
: ADJUST NONE 
11000 SHIFT | 
\ ADJUST NONE 


. 0101101011000 SHIFT 


ADJUST ~ NONE 


00101101011000 SHIFT 


\ ADJUST NONE 


000101101011000 SHIFT 


ADJUST NONE 


0000101101011000 SHIFT 


Figure 29. BCD to Binary Conversion (16 Bits to 14 Bits). 


ADJUST = NONE 


BINARY TO BCD CONVERSION 


A method very similar to the one used for BCD to binary conver- 
sion may be used for binary to BCD conversion. The BCD number 
is created by shifting the binary number up, into a partial BCO 
result. The BCD number is adjusted to provide a multiplication by 
2. The shift adjust process continues until the least significant 
binary bit is shifted into the BCD result. 


The adjustment is needed when a 1 is shifted from the 8-bit 
position to the’ 1-bit position of the next digit. the value has 
increased from 6 to 10, instead of from 8 to 16. To correct this a6 
must be added.to any digit that has a 1 shifted out of its 8-bit 
position. Akemately a 3 could be added before the shift to any 
digit that has a 1 in Its 8-bit position. : 


Another correction is needed whenever an invalid BCD digit is 
encountered. If a number greater than 9 is detected in any digita 
10 must be subtracted from that digit and a 1 added to the next 
highest digit. The same correction can be accomplished if a 6 is 
added to the invaiid digit after the shift. To correct before the shift 
a 3 Is addea to any digit which contains a 5, 6 or 7. These 
adjustments are summarized in Table 11. Both adjustments may 
be accomplished by adding a3 to any digit which is greater than 4. 


Table 12 shows an example conversion. The binary number is 
gradually shifted up and the BCD partial result adjusted. After 14 
iterations the conversion is complete. A flow diagram for the 
algorithm is given in Figure 31. 


ENR & COUNT LOOP & CONT 
PAS RO, RO LDRO & SOOL & LOCT & CONST 15 

ADD R1, RO, RO, DARB & ALUOFF & CONST 0888 & CZERO & ENSUAI & CLSR2 & RPCT 
ALUOFF & MULTI BWAY : 
ALIGN 8 

ALUOFF & CJRP & CNTR & GOTO EXIT 


SUB RO, RO, RO, LORQ,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
Sus RO, RO, RO, LORQ,OARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LORO,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LORO,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LOROQ,OARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LORQ,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
SUB RO, RO, RO, LORQ,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
PAS RO, RO, RO, LURQ & SDUL & RTN 


SOG , a 


TABLE 11. 


-Q - 


' 
‘ 


Mllegal BCD 


Shift Thru 
Correction 


2 


Initially the 14-bit binary number Is left justified by two shift up 
-operations. To start the loop the binary input, B, is shifted up, into 
the partial BCD result, A. The constant BBBB Is added to A, with 
the caries-in forced to zero. The resulting carries-out are stored 
in status register SR1. A multi-way branch is used to enter the 
adjust table. The digits are adjusted depending on the result of the 
previous test. In the same instruction a shit is executed to pre- 
pare for the next test cycle. Additionally an end of loop testis used 
{o provide an exit it 16 iterations of the loop are complete. Before 
the exit a fix-up cycle is used to cancel the extra shift executed in 
the loop. The microcode for this algorithm is given in Figure 32. 


BCD ADD 


One method of performing a 4-digit BCD add Is to do a 16-bit 
binary add, with the carres-in forced to zero, and adjust the 
resulting sum. The adjustments are necessary to change invalid 
BCD digits to valid BCD digits. When an invalid digit is modified 
@ carry to the next highest digit Is generated. This could cause a 


-) o 14, 99,11, 11 
—0 


CARRIES-IN 


=A 
SR1— CARRIES-OUT 


Figure 31. Binary to BCD Conversion (14 Bits to 16 Bits). 
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Q: =Binary Input 
RO: = 8CO Result 


SURL RO, RO & Sut & CONT 

SURL RO, RO, & SUL & ENR & COUNT LOOP & CONT 

PAS RO, RO, ,LURO & SOUL & LOCT & COUNT 15 

ADD R1,R0, RO, DARB & ALUOR & CONST BBBS & CZERO & ENSR1 & CLSR2 & RPCT 
ALUQFF & MULTI 16WAY 

ALIGN 16 

ALUOFF & CJRP & GOTO EXIT , 

ADD R1, RO, RO, LURO,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURO.DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD Ri, RO, RO, LURO.DARB & CONST 0003 & CURP & CNTR & GOTO EXIT 
ADD Ri, RO, RO, LURO.DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD R}, RO, RO, LURO,OARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADO R1, RO, RO, LURO.DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
AOD 1, RO, RO, LURO,OARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD Rt, RO, RO, LURO.DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURO.DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD Rt, RO. RO. LURO,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURO,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
AOD R1, RO, RO, LURO,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURO,DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURO.DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
ADD R1, RO, RO, LURO.DARB & CONST 0003 & CJRP & CNTR & GOTO EXIT 
SDRL RO, RO, & SOL & RIN 


Figure 32. Binary to BCD Conversion Microcode (14 Bits to 16 Bits). 


TABLE 12. 


Digit Digit Digit Binary—— BCD 
2 Conversion 


00101101011000 
0101101011000 


101101011000 
01101011000 
1101011000 
101011000 
01011000 


1011000 


011000 


11000 


1000 


me) 
previously valid digit to become invalid. The word must be 
checked and modified untii all digits are valid (up to four modifi- 
cation cycles could be necessary). 


Initially the two BCD numbers are added with the carries-in to 
each digit forced to zero. The carries out are saved. Next the hex 
number 6 is added to the sum, with the carries-in forced to 
zero, and the resulting carries out are saved. This tests each 
digit for validity, a carry-out indicating an invalid BCD digit 


D—A + 68,66; 
CARRIESIN—— 0; 
SF2—— CARRIES-OUT; 


Figure 33. BCD Add. 


(greater than 9). if a carry was generated in either cycle a 6 is 
added to the invalid digit, with carries-in forced to zero, to create 
the valid BCD digit. Additionally a 1 must be added to the next 
highest digit to provide the BCD carry-out. Each time a digit is 
adjusted the carry-out may invalidate the next highest digit. Thus 
adjustment cycles must be followed by validity tests until all 
digits are valid. A flow diagram for this algorithm is given in 
Figure 33. The microcode for this algorithm is given in Figure 34. 


A = Rt 
8: = RO 


1 ADO R1.R1.RO & CZERO & ENSR1 & CONT Z 

2 ADD A1,A1.R0,.0AR6 & ALUOFF & CZERO & ENSR2 & CONST 6666 

3 ALUOFF & MULTI IGWAY & RMAC 

ALIGN 16 

4 ALUOFF & JMP & GOTO EXIT & ENSAI 

5 ADO R1,A1.R0,,DARB & CONST 0016 & GOTO LOOP & CLASAt 

6 ADO A1,A1,RO,.DARB & CONST 0160 & GOTO LOOP & CLASAI 

7 ADO R1,R1.RO,.0ARB & CONST 0176 & GOTO LOOP & CLRSAI 

8 ADD Al.R1.RO,.0ARB & CONST 1600 & GOTO LOOP & CLRSAI 

9 ADD AS.At.RO,.OARB & CONST 1616 & GOTO LOOP & CLRSAI 

10 ADD R1.A1.A0,.0ARB & CONST 1760 & GOTO LOOP & CLASRI 

11 ADO AI.R1,R0,.0ARB & CONST 1776 & GOTO LOOP & CLRSRI 
12 ALUOFF & JMP & GOTO LOOP & SMAC & CLRSA! 
13 ADD R1,A1.A0,,0ARB & CONST 0016 & GOTO LOOP & SMAC & CLASRI 
14 ADO A1,R3.R0,.DARB & CONST 0160 & GOTO LOOP & SMAC & CLRSRI 
15 ADO A1.R1.A0..0ARB & CONST 0176 & GOTO LOOP & SMAC & CLASRI 
16 ADO A1,A1.RO,.DARBS & CONST 1600 & GOTO LOOP & SMAC & CLASRI 
17 ADO A1,A1.RO,.0ARB & CONST 1616 & GOTO LOOP & SMAC & CLRSAT 
18 ADD AT.A1.R0,.0ARB & CONST 1750 & GOTO LOOP & SMAC & CLASAI 
19 ADD A1,A1,R0,,0ARB & CONST 1776 & GOTO LOOP & SMAC & CLRSRI 


ext: 


Figure 34. BCD Add Microcode. 


SUMMARY ‘ Ss ct 2 


In this chapter, a detailed description of the Am2904 was pre- 
sented, along with an example timing analysis. Several! micro- 
code algorithms were discussed to show how the Am2904 oper- 
ates In a 2903 based CPU. As can be seen, the Am2904 provides 
a powertul, single-chip LSI solution to the shift multiplexer, status 
register, and carry multiplexer design portion of a CPU using 
either the Am2901B or the Am2903. 


The Appendix includes a full microcode listing. The interested 
reader is encouraged to study these listings to gain a better 
understanding of the hardware organization (Appendix C). An 
additional microcode listing (Appendix B) gives the AMDASM™ 
definition file and source file for the microcode. The reader should 
study these listings while referring to the AMDASM Manual. (The 
Am2900 Family Data Book contains an AMDASM Reference 
Manual, document AM-PUB003., 4-78 FRODO.) 
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APPENDIX A 


Am2904 CONTROL FIELD 


COMMENTS 


a -=“1F} 


12 
3 
wn! 
ri 
o 
Ys) 
~ 
wy 
| 
rs) 
a 
° 
oa 
F 
n 
) 
a 
3 
te 


66 


[67 


CONSTANT 


ADDRESS 


int 
Ld 


89 88 87 86 85 84 B3 B2 BI BO 79 78 77 76 75 74)73 72 71 70/69/68 


UNSIGN 


MULTIPLY 


oo 
xx 
x x 
xx 
xx 
xx 
x x 
xo 
x= 
x= 
xo 
oo 
oo 
x x 
xx 
xx 
x x 
<x 
xx 
xx 
xx 
ux 
“x 
xx 


2-4 


x x 
xx 
«xx 
<x 
xx 
xx 
xx 
<x 
xx 
xx 
xx 
xx 


oa 
oo 
-o- 


oo 


oco 
x x 
xx x 
MxM 
x x 
x MM 
ba ietel 
xoo 
xen 
x= 
xao 
oo 
oo- 


«Mu. 


xe « 
«x «Kx 
xx x 
xxx 
x «x 
x «Kx 
xxx 
x x x 
x x x 


x x MM 


«xxx 
x x x 
xx x 
xxx 


~— 3 3 2S 


TWO'S 


x x 
x xx 
x «x 
x << 
xx 
x 
xxx 
x x x 


COMPLEMENT 
MULTIPLY 


TWO'S 


‘ 


eovs9evcOoOxcoxoxooxoxxoo0o0c0oxoxoxacge 
xxXxXXKKOKOOXKMXxM MM KOM MM KKM KKM KM KOM MK KK 
HO KO MK OO OK KX 
MMMM Me KEK MK KKK KOR KM KM MK MK MM MK KK 
x MM KOK GOK MMMM KM KRM MM KK KK Km OK KK 
HC KE OK OE OO OK KK 
xxx xX KOKXOOKXMX KX MOM MMMM MK KKK KOK K KKK 
MM MK MM KK KOOKK KKK KOM KR RK HK XK KK KOK OK XK 
MMM MMMM MRK KKK KKK OM MK HK KKK KM KK KO KK 
xX KM KK KOOMK KM XK ORM RK HK OR KKK KKK KKK 
XM XK KM KM KOOKMKM XK MX OM KM RK HK OX KKK KOKO X 
(-0-R-e-e--e-B---- - -o ---- 
oo000oo--GcG00GCCCCeOCCeCeoOoOK--o0c0c0 c0 00000 
OK KOK OOOO OK OO KK 
XM XM OX OCOMMN KX KOM MK KKM MM KK KOM KKM 
2 WO KOO OKO KO KK 
KK KK KK KKK KK KKK KKK 
HK HK HEE HE KE KE HEE HK HK KO HOHE OKO KK OK KO KK 
eee eee eee eS ee eee eee 222 2 2 2 2 eee eel 
eee eee eeee eee eee eee 22 2 2 2 eee eel 


YEE HEHEHE HEHEHE KK DE SEE HE EE DE CEE EE HK KK 
MM KK MRK MMMM KKK KKK KKK KK KKK HK MK KKK 
oRerererrererererereree re Cee ee eee eee 22 
SErereerrerrerrreree eee eee eee ee ee ees: 
EoLereerrrrerrrree ef ot eer ee eee ee es! 
SoLeeerreererrreerererereer eee ee eee ee ee: 
Sorcerer ererrrrrere es C2 eee eee Se Se eee! 
SOCEreererrererre ere reer eee eee eee eet) 


2.22222 tee eee eee eee eee eo eS 222 oe! 


ieee eee eee ee ee eee eS et £222 22 2 2 222 
eee e eee eee eee eee ee eee eee 22 2 2 2.5.2.2.) 
HK HK HK HK KK KKK KK MK MK HK HK HK HK KOK HK KK KK 
Pe eee e TEC SCeOCeCeereeere eee ee eee ts tt.) 
Pe eeee tee ee ereee eee ee ee 22 2 2 2 2.2.2.5.) 
ieee eee eee eee ee ere r eee ee 2 2 2 2 55.22! 
’ HE HK HE HK HK HE HK KEE EK EE HEHE HK KK HEHE HK KM HK KOH HK KEKE 
Pet eeeceeeeceeeeeeeeee eee ee eee 2 2 22! 


SCALE 1: 
SKIP 6. 
SKIP 3. 


= OY] 
8 a 
g g 
MKMDOOWLOKNMNCNHNOROMCKCMDOOWLOKNO THOR OA 


PKR KKK KE NNNNNNN NNN NNO 


Oe ll ol ol eae 


--e----0- 0-0-8 -- - - -- - -- 


COMPLEMENT 


OIVIDE 


SHARED CONTROL FIELD 


' CCU CONTROL FIELD . 


: ° ox -xx 

-| . x _OxMx 

ae a} =x Ll ee 
5 o ex eux 
aye v ox _Oxx 
ee “| ox ~Oxx 
58 o| ox dole 
z n ox —exx 
“leo ox oxMx 

a ox > 5! oxx 

we S ox’ ~“oOxx 

= .ox “Ooxx 

so373S atsy]S[ = x “xx 
EN Ey oo ooo 

N3 SNivis| =| oo. ooo 
N3 131HS| Y} xo xoo 

> g 2 .~9e0. coo 
sky SS]. .©e0-,.0ce0e0 
£26 |e] -o -..-c0 
< a o- oer 

g O-, Ore 

N OE oe 

8 o- e Ore 

g om: ‘owe 
[mnvioa|z|__o o_o ox 
[wauilg[— = 
g oo ooo 

RI coo ‘coo 

: oo “eco 

HH co ooo 

co ooo 

E co ‘ooo 

o oo ooo 

oo, -coo 

oe) ~a- eo 

: -o |. 9° 

oo ooo 

at oo ooo 

B 18 -o oor 

g 2/2 -o -o- 
2S oo ooo 

'e E -o -o°o 

w -o “seo 

2 : --.oO —-oc 

< -o --oo 

2 = x Ke 

w : xo xoo 

8 § xo xoo 

2 xo xoo 

wale, 22 3 - ee 
oth 3 xo xoo 
836 a xo xoo 

< g xo xoo 
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COOXKXOXK$OX KKK KX KOK XK OXKKOOK KK KOKKK 
COOXXK§—XOOXX KK XKXOKKXKOXXKOOOX—KOXXKX 
COOXX HK XK OXKXKOXKK OX KK KKK KK OK HK KOKKX 
COOXXOXO—XKXOOXXK-OXOK—XKXKXOOK— XOX KKK 
COQXXOXOCOXKOK—XXO-—KXOOXXKOOK XH KK KKK 
COOXX HK KKK KKK KKK KK KK OKXKXKOOOKX Ke XK KKK 
COOXX-§XOOXXOOXXOOXOOXXKXOOCOXOX OX XX 
COOXKXOXCOXXOCOXXOOXOOXXKXOADOXOX OX XX 
KR RK KK KR KKM RK RK KKK KKK OKKK KR KKK KKK KKK 
COOXXOXOCOXXOOXXOOXOOXXXODOXOXOXXX 
COOXKXOXOOXXDOXKXOOXOCOXXKXODDOXOXOX XX 
COOxXXOxOOxxXxAaxxXxO0Ox00xxxXxO0OxX Ox Ox xx 
wee XK RK OR KR KK RK KKK RK KKK KKK KK KK KK KK KKM 
CeCCCOCCOO-CO-o-cCO-0o--x0000-0-0-000 
CCCOCOC COCO COC OCOC OC OCC OOXTSGSDCCC OC COCO 
xx MX KKK XK OOX KKK XK OX MKOOXKXXKOXOXX 
wer CO-9--0O---0O--0-Ox00---0-O0-000 
COCe ree Ke Ke Ke KK OK KK OK HK OKXOK- COOK KR me meee 
eooorWror oor er - On K-OoOOrnKr eK XOn~-OOK- HK Or wK- KK So 
CO0O--O-00O---0--00---x--00O-- Oe ere ee 
COOXX—xX——XXXAXXK-OXKXOX-KOOXX—XXXXO. 
COOXX XK —K—KXXKXOXKXKOXKOX-§-KOOXX HK XXXXKO 
COOXXOXOOXXXOXXOOXXOX=-XOOXKOXxXxxO 
COOXxX-—xX——xXxXKXOXKX-OXKOX—-XOOXX-—XxXxXxXO 
COOXX—XH—— XXX OXKXOOXXOXOXOOKXK— xX KXXX 

a a 

-w-e-e-e ee 

-e-e-E-E-E-E-B- 0-0-0 2-2 
e-e-e-R-e-E-O--O-E-----0-0---- - - -) 
e-e-e--e-E-e-E-E-- --B---------------- 
KCOCOCCOCOCOXOOXOXODOXOXXOODOCSOXOXOXO00 " 
xCCOCCOCOCOXxCOXOxCOxOxxOC00-OxOxOxXxO-0 
KCOCOCOCOCCOXx0OXxOxDOxXxOxxDODD0OXOXOXO00 
eoocoeoeoeOx--xOxCOox-xxOOD00OXOXxOxXxC00 


(Ore ee Kc OoxoOoxe-xX—-XOxxCD0O-xX—KXOX-Ge— 
Oren er HK OOX HK KKK KOK XK XXOOCOKK XK KOX KKK 


K-COCKCTOCCOXCAXxOxDOXxOxxDDDO0OXOXxOxXxO00 
we wr nen nm Ke KE XK KE KOXOOXK XKXOOCOK HK KKH XK— KOK HK 
wwe KKK COXOCOX KX OK KXOXKX KOK KK KKH KH Kee w 
were en Ke Ke KOOK KKH KK OXKOR KR KE KKK KOX ORK 
weer KK COX KK XOXKOXOXKX KEK KKK KK KOKXOKK 
Mer eK KM KKK KKKOXOOX XX KK KK KKK KKK KK KO 
Meer Kn HK OKXKKKXKOXK OXKX KK KK KOXOXOXOX = 


"Kew eK Onxx XK KK RK KOK KKK KK KK KOKXOXOXOXS 


xe KK COKXXXXXXDKXOOXXXXOOCOXOXOXOXOX— 
MMM KKK ORK KOR KKK KK OK KK KK KM KKK KKK KK 
KEM K KK RR KR KR KKK KKK HK KK OOO MK MK KM MK KK 
XXX KKK OOXOOXX XOX XOX X OOO XX KKK KK KK X 
xx KKK OOKXOCOXXXOXxXOXKXOOOX KK KKK KK KX 
O- KOK COOK KK OXKXK HK KKXOXKX KE KK KK KKK HK KOO 
we KOO Km mem Ke KK KKK MK KKK KKXOOOHK K KXAXOXOOe 
O-e-C0COCOXOOXXX—-xXxXOXXOOO--xXOxXxOxXxO-O0 
-CCOCOCCOXOOXKXX—-XxXOXKOOCOK--xXOXOXOO— 


162 


APPENDIX A 


Am2904 CONTROL FIELD 


' 6 
G 
89 88 87 86 85 84 B3 62 B1 BO 79 78 77 76 75 74173 72 71 70169|68167166 165 64/63 62 61 60159 58 57 56 55 54/53 


COMMENTS 


5 


CONSTANT ’ 


ADDRESS | LABEL 


oxoooco 
xooooxx 
xOr-— xx 
xerwrnr Ke XK 
xooo0oxx 
xXwrrr xx 
KOCOOXX 
xxx xX «KO xX 
xx MK MO XK 
xx xx xX Ox 
xx xxx Ox 
eooor-rocoo 
oooooco 
x KKK KKK 
xooooxx 


MK K KK KK 


x «KKK KKK 
x «KM KK KO 
MM KM KK 
x KKK KK 
x KM mK KK 
x «KKK KK 
«KM KK KK 
«eM «KKK KK 
MMM MK KK 


x «KK KK XK - 


x «KKK KK 
mK KKK KK 
Mam K KK KX 
KM KKK KK 
x KK KK KK 
MR KM KK 
MMM KK KK 
x mM MK KK 
MxM KM KKK 
MMM KK KK 
xa K KKK 


SINGLE 
LENGTH 
NORMALIZE 


oxoooococo 
xoooOoxxx 
xOnrerw XK X 
xerwrnr KKK 


"«KOOOOKxX x 


< 


Kew e - x xK XK 
xCOCOCOOXxXx 
xx xXxOOXxXxX= 


MXR eK x KO 


xx KKK KK 
xx xoOOxXxxs 
ooooorro 
eoooooooco 
RK KK KKK 
xOooooxKxx x 
uM K KK KK KK 
xx KKK KK 


— MM KM KK KK 


x KK KK KK OK 
x KKK KKK OK 
KKK KKK KK 
x KM KKK KK 


“KKK KK KO 


xx KK KK KK 
x KM KK KKK 
x «KKK KK KO 
MM KK KK OK 


xxx KX KKK. 


MK KKK KK 


KKK KKK 
- RK KKK KKK 
_ 6K KK KK 


DOUBLE 
LENGTH 


Mm KK KK KK 
MK KK KK KK 
x «KM KKK KK 
xx MK KKK 
xx «KK KK KOK 


LOOP4 
JUMP1 


@a<nmnoowuw 
wzwT7TTwTWVWT YW 
werwrereree 


ceoooooco 


NORMALIZE 


cooooo0oo0oo0oo0o0o0o0o0ceece0c0co0ceo 


xX XKKKKKXOOKX 
KK KKK KK KR OM OK 
KKK KK KK KO KX 
x KX KKK KKK KR RK KK 
x XX KKK KKK KOO XK 
xX KKK RKO EK 
KK KKK KKK KH 
HK KOE EOE EEK 2K 
tere ee te ee 2 
a. tt 2 2 2 2 2 2 2 oo 
_,eeeo0o0000000C00 
ooeooooocococnoe 
x «KKK KKK KK KK OK OK 
x «MK KKK KKK KOO xX 


x «KKK KKK KK 
xx xKxoOooo0o00ox 


Se MWK KK KK KK 
7 OTM MK KK KK XK OK 


xx KK KK KOK OK 
x KM KKK KKK 
xxx ee eK OOX 
xx xO--0COx 
xxx O- COX 
xx xOCO-O-x 
“XXX OO-OOX 


--* xX xX OCO- COX 


-xXxxOO-0COx 


xx xoOoQ-oox 
xx xoo-00x 
xxxooroox 
xx xoo-0oox 
xx xOoOO-0C0 Ox 
xx xoorocox 
xxxooroox 
xx xooo0r-0x 
xx xoo-00Ox 


NO TNOR AA 
nunnnnnnuw 


w3wr3wr ener we ee 


5 B| CYCLE'|X X 


x «x 
x x< 
xx x< 
xx x 
xx x 
x x x 
x x x 
x << 
x >< x 
xx 
xx 
xx 
<x 
xx 
xx 
x x< 
x x 
x x 
'< 
xx 
x x< 
x x 


xX XXXXXXX xX 


x 


oo 


--=— 


XXXXXXXXKXKXXKXX XK XX KX KIXX 


ww 


x< 
x 
x< 
x< 
x 
x 
o 
o 
x 
x< 
< 
x 
< 
x 
x 
Pad 
x 
~< 
x< 
=< 
x< 
x 
< 
x< 
x< 
x< 
x< 
x 
x< 
x 
x 
x 


oooooo0oo0oo0oo0oo0co0oo0o0°oco 


XX XXXXKXK XX XK KX KX MX KK XIXIXIXIXIO O;X XX XIX XX XXX 


oOxx 
on-r- xx 
~-w ex 
o-xx 
-— KK 
ooxKx 
«x x «Kx 
x «x x 
MK 
x «KK x 
oooo 
oooo 
«x «KK 
oox< 
xx x 
x «x 
<x «x x 
x «x 


xX xXX xX 
x XXX 


x 
x 


XX XXXXXXK KK XeX 
XX XXXXXKX XK XX KK XX XK XK KIX X/ XX 


x 
x 


sus 


BRANCH 
COUNTER 


INPUT/OUTPUT 


SHARED CONTROL FIELD 


FUNCT. 


Am2903 CONTROL FIELD. 


SRC. 


DEST. 


no 
wn 
w 
c 
[o} 
a 
< 
wo 
o 
Ww 
c 
Q 
fo} 
< 
w 
n 
w 
c 
ao 
a 
< 


xOo-- Ox Xx 
x-O0OxXxX 


_. MOCOCOOXKx 
xr OOOoxx 
er RR KX 
xooooxx 
xr erme ex 


xooooxx 
— MR ewe KK 
-xKxoOoOoOoOCcOo 
ooooooo 
xxx xx Ox 


“Sr oo0oeo-- 
Kern KH Kx 
xe eee KX 
xc00oxx 
xe eee xX 
xr KH OX 
Mes Seine 
eoooge0a 
ooaccoce 
cooocooc 
eoooocKoeo 
oxxoooo 
oxxooo0o 
oxxoooo 
oxoooxe 
oxoooxo 
oxoooxo 
ox-90x0 
oxn- OO = 
ox-ooo- 
ox--oo-- 
eK - O- 
xxx xx xO 
xxx xx xO 
xx x KK KO 
xxx xx xO 
oxxooxo 

FF SMe ere Ke 
oxxooxo 
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APPENDIX 


Am2904 CONTROL FIELD 
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APPENDIX B 


AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
CPUII DEFINITIONS 


SADVANCE MICRO DEVICES 
A42923 AND AM2904 DEFTNITION FILE FOR CPUIL 


sREV. OCTOSER 17, 1978 


VSM: ecu GeF 
SP2s EIU F#@ 
OFF: EQU Bal 


$2903 DESTINATION MODIFIERS 


ADE: FQU H#@ 
LDR: ZOU Hl 
ADRQ: EQU E#2 
L3RQ: EQU H#3 
HFT: EQU E#4 
LDQP: EFQU F#5 
QPTs EQU 3#6 
KQFT: BQU He? 
AUR: EQU E#e 
TUR: EQU ERO 
AURQ: EQU HHA 
LURQ: EQU PHB 
Y3US: EQU 44C 
LUS: EQU EAD 
SINX: EQU E#E 


+CCKSTANTS 

Re: FQU PHE - 
Pl: EQU R¥1 . ‘ 
R2: EQU E#2 

RS: EQU Fad 

h4: FOU H#4 

E53: . EQU Er5 

R63 EQU E#6 

Rs EQU HA? 

Re: EQU H#S& 

RS: EQU 549 


R16: 1OU HHA 
Riis: EQU EXER 
R12: EQU Eat 
B13: FQU KD 
Elé: EQU FAE 
R15: BOU HH? 


AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
CPUIIT DEFINITIONS 


42983 SOURCE MODIFIERS 


RADB: 
FAQ: 
DARE: 
DADE: 
DAQ: 


11/0 


IOIN: 
BIN? 

BOUT: 
LMAR 
YREG: 
AOUTs 
IOUT: 


EQU 
EQU 
EQU 
EQU 
EQU 


SBH2O1 
3B#610 
3BA1008 
SB#101 
3B#110 


L2HA@L 
12E#16 
12H40e8 
12H#19 
12Hae2 
12H#40 
12H#04 


sCARRY SELECT 


ONE: 
CZ: 


7SUB 


SURG: 
SUB1: 
SUR2:s 
SUBS: 
SU84: 
SUBS: 
SUB6: 
SUB7: 
SUPas: 
SUB9: 
SUB1@: 
SUB112 
$U3s12: 
$U313: 
SUBR14: 
SUB15: 
SUB16: 
SUBL7: 
SUB18: 
SUB19: 
SUB283 
SUB21: 


EQU 
EQU 


SUB 
SUB 
SUB 
SUB 
SUB 
SUE 
SUB 
SUB 
SUB 
SUB 
su3 
SUB 
SUB 
SUB 
SUB 
SUB 
SUB 
SUB 
SUB 
su3 
SUB 
SUB 


2BH61 
234108 


SPINITIONS 


S6X,1BA2Z, SVX ,4VX,4VX 
Z6X,1B#2, 4VX,4VX, "AVX, 4VH#F 
Z6X ,1BHO,4VX,4VX , 4%, AVE#F 
3VB#000,16X, 1E#O, 13X 
36X,1B48,12X 

44X,15#0,15X 

44X,1B#Z,15X 

26X 

36X,1BHG,4VX,EX,4VH#F 
36X,1BH#0,4VX,4X, wey 4VEHF. 
36X,1540,4VX, AVX, 4 

24X, 2VPHOG, 34X, abeneoe, 1241,5X 
77X,1B#1, L2VXEHO% 

SPF, 3VBHGG0, 16X,1B#0 ,13% 
24%, 2VE#0G, E4X,4E#0000, 2°#18 
23X,1B40, 6X 

SPF, 3B#000,16X,1VB#O,12x 
54x 

22X,1BH0,7X 

16X,1B#2,13X 

1X,1VB40,14X 

3OX,H#B,2O0X 


3CCU CCNTRCL 
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EEE IEE I 
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AMDOS/29 AMDASM MICRO ASSEMBLER, V1.1 
CPUIT NEFINITIONS 


ACK: DEF 66X,E#9,20X 
OBF: DEF 66X,H#A,ZOX 
CNT: DEF 66X, HAF, 20X 
GRD: DEF 66X,F#@,20X 
J2: DEF SUB11,H#0,SUB20 
CJS: EEF SUB11,E#1,SUP20 
JMAP: DEF SUB11,H#2,SUB20 
CIP: DFF $UB11,H¥3,SUB2@ 
PUSH:  DsF SUB11,B#4,SUP20 
JSPP: DEF SUB11,5#5,SUB20 
CoV: DEF $UB11,E#6, SUB2O 
JRP: DEF SUP11,b47,SUB20 


RFCT: DEF SUB11,H#@,SUB22 
RPCTs DEF SUB11,E#9,SUB20 
CRIN: DEF SUP11,H#A,SUE29 
CJPF: DEF SUB11,H#B,SUB20 
LDCT: DEF SUB11,B#C,SUBZO 
LOCF: TfF SUB11,H#D,SUE29 
CONT: DEF SUBI11,E#E,SUB28 


JP: DSF SUBI1,H#F,SUB20 
JSP: TE? S$Ub14,H#61,SU520 
RTN: DE? SUB14,H#GA,SUB28 


sSHARED CONTEOL FIELD 

GOTC: BEF SUB12 

COUNT: DEF SUB12 

PUT: DEF 77X,1B48,1ZVXH#OX 
;POLARITY CONTRCL 


DEF 65X,1B#1,24X 
DEF €5X,1B4#0,24X 


tone 


32923 CONTROL/FUNCTIONS 


Ins DFF 36X,1341,AeF, OX Aer, H#O,19X .1B#8,13X 
NUT: TEE 262,1240,EX,H#F BHC, HHE, SUES 
YORF: DEF 36X,1B#1,53% 

EIsH: DEF SUBS ,H#Z,334912,SUB19 

SRS; DeF SUB1,A#1,SUB2 

SSR: DEF SUB1,H#2,SU33 

ADD: DEF SUB1,H43,SUBE 

PAS: DEF SUB2,H#4,SURZ 

cOMS : DEF SUB2,E#5,SUBS 

PAR: DEY SU89,H#6,SUB3 

CCVR: DEF SUEO,H#7,SUBzZ 

LOW: DEF SUBS ,5#E€,3X,SUB19 

CUAS: DEF SUB1L,H#9,SUBS 

XNSS3 Ter SUEL,H#A,SUBZ 

XOP: DEP SUB1,B#3,SUB3 

AND: DEF SUB1 ,H#C,SUBS 

NOR: DEF SUP1,E#r,SuUBe 

NANT: DEF SUB1,H#=,SUBS 

CRs DEF SUBi,##F,SUSS3 


$2S¢2 SPECIAL FUNCTIONS 


AMDOS/Z9 AMDASM MICRO ASSEMBLER, V1. i 
CPUIIT DEFINITIONS 


UMUL: 
. TCMs 
SMTC: 
TCMC: 
SLNs 
DLN: 
TDIV: 
TDC: 
INC? 
SDQP: 
SUQP: 
LQPT: 
RMOV: 
QMOV: 
SDRL: 
SURL: 


52904 


SDDE: 
SOUR: 
SDDL: 
SDUL: 
EDD: 
RDU: 
SSXO: 
RSD: 
RSU: 
SUL: 
SU: 
SDL: 
SD#: 
SDMSs 
SMS: 
SDDC: 
SDUC: 


32984 
RSTLs 


SWAP: 
SHLD: 


12984 
LMA: 


RSTA: 
SEOLD 


» 2904 


DEF 
DEF 
DEF 
DEF 


SUBO ,H#S,SUB16 

SUB ,H#2 ,SUB16 
SUB10,E45,SUB16 

SUBS ,H46,SUB16 

SUB1@ ,H#&,SUB16 

SUB ,H#A,SUB16 

SUBO,H#C ,SUB1E 

SUBO ,H#E,SUB16 
SUB10,H#4,SUP16 
SUB4,B4#5,4X%,SUBS 

SUB4 ,E#D,4X,SUBS 

S6X, 1B#, 8X, 4VX,H#E6, HAG, SUES 
SUB2, q#4, SUBS 

S6r, 1Ba, 4vVX, ey ,MEM,H#4,3BH010,SUBI9 
SUB1O , HAL, H#4, SUBS 
SUB18,H#9,H#4,SUBS3 


CONTROL 


SUB? ,H#3, SUBE 
SUB? ,H#?7,SUBS 


* SUB7 , HHE,SUEG 


SUB? ,H#6,SUB5 
SUB? ,H#F,SUBE 
SUB? ,H#F,SUBS 
SUB? ,H#2,SUB6 
SUB? ,H#A,SUBG6 
SUB? ,H#A,SUBS 
SUB? ,H#2,SUB5 
SUB? ,H#3,SUB5 
SUB? ,H#@ ,SUBG 
SUB? ,H#1,SUB6 
SUB? ,H#5,SUBE 
SUE? ,H#2,SUB6 
SUB? ,H#?,SUBG6 
SUB? ,H#4,SUB5 


INSTRUCTION CODES 
3X, 6B4#000211,SUB17 


3 X,6B#0@0012,SUB17 
1B#1 


MACEINE INSTRUCTION CODES 


DEF 
DEF 
: ODFF 


MICRO 


SUE15, GBA#200008,SUB17 
SUE15 ,6B#0@0711,SUF17 
23X,1B#0,66X 


STATUS SELECT 
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CPULI DEFINITICNS 


MIZ: DEF $UB18,63#010100,SUB21 
MIO: -DEF_-§UB16,6B#010110, SUB21 
MIC: DEF SUB1S,6B#011010, SUB21 
MIS: DEF SUB18,654011110,SUB21 
32904 MACHINE STATUS SELECT 

MAZ: DEF SUB18,6B4100100,SUB21 
MAO: DEF SUB18,6B#180110,SUB21 
MAC: DEF SUB1S,6B3#101010,SUB21 
MAS: DEF SUB18,6B#101110,SUB21 
;DEVICE DISABLE 

ALUCFF: DEF 7 X,1B#1,13% 

ALLOPF: LEF 7 X,3B#111,13% 

SLOAD CONSTANT 

CONST: DEF 16 VIH#O%,42,12#0,69X 

SBCD STATUS RZGISTER CONTROL 

ENR: DEF 16X,1B@,73X 

CLSR2: DEF 17X,1BA0,72X 


ENSRi: DEF 18X,1B41,71X 
CZtRO: DEF 19X,15#0,70% 


END 
‘TCTAL PHASE 1 BRRORS = @ 


aMDOS/29 AMDASM MICRO ASSEMBLER, V1i.t 


sALVANCE MICnOQ TEVICES 
3 AMZSZ3 AND AM2S04 CPUII SOURCE FILE ~ 


120 ORS Fx120 
@12¢ INP: ALUOFF & T & CBF & CJP S SCTO IKE’ 
8161 ALUOGFF & PUSH : 
2122 IN & T & OBF & LOOP & PUT ICIN 
3183 ALUCFF & RTN 
6104 CUTP: CUT & CONT & (PUT YREG 
2105 _ALUOFF & PUS! 
B1LE ALUCFF & F & riGE & LOOP & PUT IOUT 
6187 ALUCFF & PUSR : 
2126 ALUCFF & T & ACX & LOOP 
2105 ALUOFF & RTN 
@16A USM: .. LOW R1 & JSR & GOTO IAP 
2105 PAK R@,R15 & JSR & GOTO INP 
212C LOPT R15 & F & GRD & PUSH & COUNT 205 
21D UMUL R1,R1,k3 & F & CNT & SDDL & RFCT 
G12E PAR K15,R1 & JSR & GOTO OUTP 
C1LOF QMOV R15 & JSR S& GOTO OUTP 
2114 JP & GCTO USM 
€111 SM: LO#® ki & JSk & GOTO IXP 
£112 PAR RG,Ri5 & JSR & GOTO INP 
@112 - LQPT Rid & F & GRD & PUSH & COUNT 20D 
€114 TCM R1,R1,k2 & F & CNT & SDIL & RECT 
@115 TCMC Ri, Rl, R@ & SDDL & CCNT CZ 
2116 . PAR R15,R1 & JSR & GOTO OUTP 
€117 QMOoV R15 & JSR & SOTO OUTP 
21128 ALUCFF & JP & GCTO SM. 
211S DIV: LOW R1@ & JSR & GOTO INP 
@114 PAR K7,K15 & JSR & GOTO IAP 
211B ° PAR R1,R15 & JSR & SOTO INP 
@11C PAR R4,R15 & CONT 
-@11D LOOP1: FAR R3,R7 & CONT 
@11E PAR R2,R1-& T & MIZ & CJP & GOTO. ABCRT 
@11F SMTC R2,R2 & CONT CZ 
2120 SMTC R3, RS & fT & MIO § CJP CZ & GOTO SCALE1 
2121 ALUOFF & T & MIO & CJP & GOTO SKIPE 
9122 SURL R3,R3 & SUL & CONT 
0123 SURL R2,R2 & SUL & CONT 
2124 ALUCFF & JP & GCTO LCOP2 
@12= SCALE1: LQPT R4 & JSR & GOTO SDIVD 
2126 ALUOFF & JP LCOP! 


@127 LOCP2: SSR R15,RS,R2,YBUS & CONT ONE 
@128 SKIPE: LQPT R4 & F & Mic & CJP & GOTO SKIPS 


@129 ALUOFF & JSR & SOTO SDIVD 

W12A SDRL R2,R2 & SDL & CONT 

€12B ALUOFF & JP & GOTO LOOP2 

@12C SKIPS: ALUOFF & F & GRD & LDCT & COUNT Z22C 
@12D DLN R1,R1,R7 & T & GRD & RDU & PUSH 
@12E “TDIV R1i,R1,R? & F & CNT & RDU & HFCT CZ 
@12F TDC R1,R1,R7 & SUH & CONT CZ 

8130 QMOV R15 & JSR & GOTO OUTP 
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2121 PAR R15,R1 & JSR & GOTO OUTP. 

61c2 ALUOFF & JP & GOTC DIV 

©133 SDIVD: PAR R1,R1 & CONT 

2134 ALUOFF & T & MIS & CJP & GOTO KEG 

€125 PAR R1,R1,A4DRQ S&S SDDL & CONT 

Y13E ALUOFF & JP & GOTO RET 

2137 NES: PAR R1,R1,ADRQ & SDIL & CONT 

€1ze RET: - QMOV R4 & COAT 

¥13¢ PAR R13,R1d & RTN ONE 

2134 SLNORM: JSR & GOTO INP 

2123 LQPT R15 & CCAT. 

213C SLN R2,n2,CFF & CCNT & SHOCLD 

@13D MAZ & T & CIP & .GOTO ABOFT - 

@13z MAC & T & LOW R@ S&S CJP & GOTO END 

d12F SLN k2,R2 & MAC & T & CJP OKE & GOTC END & SUI. 
2142 AGAIN: SIN k2,k2 & MIO & F & CUP ONF & GOTO AGAIN & SUL 
@141 SDQIP & SMS & COKT 

914 SRS R2,R2,RO & CCNT 

2142 QMOV k15 & JSR & SOTO OUTP 

@144 PAR R15,R2 & JSR & GOTS OUTP 


0145 EMD: @P & GCTC SLNGRM 


214€ LLKGRM: JSR & SOTO INF 

0167 LQPT R15 & JSR & GOTC INP 

#142 DLN R15,R15,R15,CFF & CONT & SHOLD 

14S MAZ & T & CJP & SOTO ALCHT 

14a LOW R2 & MAC & T S& CIP & SCTO END2Z 

v14E DLN R15,R15,R1S5 & SDUL & MAO & T & CJF & GOTC JUMP1 
Z214C LOCP4: TIN R15,k15,n15 & STUL & MIO & 7 & CJE & SOTO JUMP 
e144 PAA R2,R2 & JP CNE & SOTO LOOPS 

214k SUMP1l: PAR R2,Ré & CCNT OKE 

€14F STdQ R1E,R1S & SDMS & JSR & BUTC OUTP 

219¢ QMOV R15 & JSR §& GOTC OUTP 


W151 EkDe2: JP & GCTC DLAOKM 
“152 SQ rT: 10? Rid & CONT 


€13¢ LOe RZ & JSR & GOTC INP 

0154 PAa H1,r15 & CONT 

Z15= PAK h2,H0,,TARB & CONST 38@5 & CONT 
€15€ FAR 25,R@,,DARB & CONST 2403 & CONT 
e157 FAX t4,80,,DAR? & CCNST PSBEFF & COAT 
W15E PAR KE,k2,,LAKB6 & CONST 4262 & COKT 
@15¢ FAR RE,RO,,DARB & CONST J3¥28 & CCXT 
e1dh SnS R2,R1,RS & CCNT & SHCLD 

€15B CYCLE: aND R5,R5,R4 & CONT 

t1oc SDRL 14,84 & MAS & CJP & GOTO ENDZ 
215i SUdI 4#@,ke0 & T & MAS & CUP & SOTO POS 
W153 GR R5,RS S&S JP & SCTO CAT 


415r FCS: CR R5,R2 & COKT 
21€E2 CAT: Sx35 FE,RE,R1IB & CONT 


121 SERL Kk2,R2 8 T & HIZ & CIP ,SHLD & SCTO ENDS 
£162 SDAL kS,RE 4 T & MAS §& CJP & GOTC SUE 
clés AIL A2,k8,R5 & JP & wOTO CYCLE §& SnOLL 


é15~ SUB: SRS R@,Al2,25 & JP S VOTG CYCLE & SHOUD 
135 rAd JP & GCTC SORT 
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G1i6€ ABChT: ALUOFF & JP & GCTO ABORT 
2167 JF & GOTO DIV 


END 
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#102 
8181 
8182 
818s 
2104 
8105 
212é 
2187 
6126 
0105 
C1CA 
@132 
£12@C 
0120 
O18E 
212F 
211 
2111 
@1lle2 
é11¢ 
9114 
2119 
g11€ 
0117 
wile 
€119 
@11A 
Z11F 
211¢ 


XXXXXXAXXAXXXXXX 
1110100011X0118¢ 
XXXAXXXXXAXXXXXX 
UXLXXAA1LGEXSAAXX 
XXXXXXXXXXXXXXXX 
1112@161131Xeea¢e 
XXXXXAAXXAXXXXXX 
18882604019 X21XXX 
AXXXXXXXXXAXXXXX 
IXXXXA1110Xs20ce 
XXXXXXXXXXAXXXXAX 
UXXXXXOG1IGOXG1XXX 
XXXXXAXAXXAXAXXXX 
1816811101 421008 
XXXXXXXXAXAXXXXX 
UXXXXXCIBOXC1IXXY 
RXXAXAXXAAXAAXXAX 
11188121@1X@1X7X 
XXXAXXAXAXXAXAKAX 
1806001010 X81 XXX 
NXXXAXXAAXXXXXXX 
1220c00021X001¢0 
AXXAXXXXXXAXXXXX 
1GELL2GLO1KEB100 
FXXXXAXXAXAXAXAAX 
1204600128X20120 
XXXXXAXXXXAXXXAX 
10111 1162O2O0XXX 
XXXXXXXXXXXXXXXX 
1262@22201402103 
XXXAXAAXXXXXAXXXK 
12820800301X22120 
AXXXXXXXXAXAXA KAY 
LY XXXX1111X3X12ee 
XXXXXXXXAXAXXXXX 
1€@2320201XG2120 
EXXXXXXXXXANXXXXK 
1200080621 4801608 
AXAEXXXXXAKAXXXX 
1228082128 X20180 
XXXXXXXXXXXAXXXXX 


1211111002822XXX 


AXXXXAAAAXAARAAAX 
LXXXXX11103dIXXX 
RAXXXIXYAXXXXXAXX 
12368280201 X2210¢ 
AXXXXXXXXXXAXXXX 
12220¢2221X08122 
AXXXXAYXXKAXXXAX 
1XXXXX1111X81144 
XX XXX AXXXXAAXXXX 
122 202C071X22100 
XXXXXAXXXX AX AKXX 
1@2@vC0201X20180 
AXAAXAXAYAAAAAXXK 
1924000001 X00123 
XXXXXAAXAXXXAALX 
AXXXXX111 Sx HKAX 


XXXAKXXXGLSXXXRAX 
81262220008 
XXXXAXFKXAS|SXXXXXX 
XXXAXAXXAX 
XXXXXXXXGSXXXXXX 
2802000001 
XXXXXEXYSEOXXXXXAX 
XXXXXXXXXX 
XXXXXXXXOLXXXXXX 
GOZ28SEC19 
XXXXXXXXGAXAXXXX 
XXXXXXXXXX 
XXXXXXXXOOXXXXXX 
3890032138 — 
XXXXXXXY@OXXXXXX 
XXXXXXXAXX 
XXXXXXXXSOXXXXXX 
XXXXXXXXXX 
XXAXAXXXXGOXAXXXXX 
XXXXXXXXXX 
XXXXAXXX OLAX XXXX 
C1LE LeeLee 
XXXXXXXXIOXAAXAK 
G1C20ebe72 
XXXXXXAK DCE AXXXX 
02629011128 
XXXXXXXXO9G2112XX 
XXXXXXXXXX 
XXXXXXXXSOXXXXXX 
Z1LHO2L1ES 
XXXXAXXXAXSOXXAXAX 
8106203190 
XXXXXXXXODX AXXXXX 
0122221818 
XXXXXXXXBOXXXXXX 
Z122L07800 
XXXXXXXXSBXXXXAXX 
0130000222 
XXXXXXXXIBLXXXXXX 
BO22C21121 
XXXXXXXXIIB114XX 
XXAXXXXXXX 
XXXXXXXF100119XX 
XXXXXXXXXX 
XXXXXX AA SOXXXAXX 
@186002100 
XXXXXXXXIDOXXXXAZ 
91229221008 
XXXXYAXXXSOXXXXYX 
2108216€01 
XXXXYXXXS|XAXXAXX 
B1C2S08eeH 
XXXXXXXXSBXAXXIX 
01860000022 
XXXXXXXACOXAXKXX 
91329333202 
XXXXXXAX VLA XXXXX 
XAXAXXAXY AX 


XXXXXXXXXXXXXXXX 
YXXXAXAXXXXXAXAX 
XXXXLLILAXXXXXXX 
XXXXXAXXXXAXXAXX 
XXXXEXXXXXXXX111 
EXXXXXXXXXAXXAAX 
AXXXXXXXXX XXXXXYX 
XXXXXXXXXXAXXXXX 
XXXXXXXAXX XAXXXX 
XXAXAXAXXX AAXX LX 
XXXXOCV}SLX XXXXXX 
XXXXBIOBOXALX111 
XXXXSXAXXXXXKL11 
XXXXZ22910201200 
XXXXQL1LAXXXXZOI 
XXXXB1LLAXAXXXXX 
XXXXXXXNXKZXXXXX 
XXXXZSRIAXXXAXAX 
XXXXS2OSLXXXX111 
XXXXOXXXXXAXX111 
YAXX29SS10301204 
XXXX000E126219e 
YXXXB1ILLIXIXX2O™ 
X¥XXB1LLAIXXXXKXX - 
XXXXXXXXXAXXXXXX 
XXXXO101 QAXXAXXX 
YXXXOOLLIXXNX111 
XXXXGZECI XXXL 11. 
XXXXO212EXXXK111 


‘ 


XXAXXXXXXAXXLOWB 
XXX LXXKAXAXX GIO 
X11112003xxxea00 
XXXXXXXAXXXXSLOS 
1112¥21189e2¢20e 
XXXXXNAXXAXNZGWS 
XXXXXXEAKAXXLLLE 
XXXRAXAXXXKXVVO 
XXXXXXXXXXXXOGIZ 
XXAXXKXLXXXK LOSS 
X111110@OXXX2ee0 
11111211¢8200202 
1311¢¢11¢e0eeC02 
@GALSezELeseeo2ze 
11111¢1132002002 
X111181800122020 
XXXXXXXAXXXXLOLO - 
X11111890XXX2204 
11111011¢200200e 
1211¢@1192¢¢¢229 
0221280220202000 
abi1edeneoe2cao0 
11111911220eec02 
X111121280128202 
XXXXXAXXAAXXKXGSWO 
X11111229XXX2000 
111119118282320 
111116119292c00¢ 
11111¢119e¢oeee2 
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0115 
211¥ 
@11F 
0126 
8121 
#122 
2123 
0124 
@125 
012€ 
0127 
@12& 
612S 
@12A 
2122 
012C 
@12E 
@12E 
Ol2r 


@15¢ 


XXXXXXXXXXXXXXXX 
UXXXXX1TLIBXOZAXX 
AXXXXXXXXXAAARXXX 
1110110011X230133 
YXXXXXAXAXAAXXAXX 
LXXXXRA1LLIEXACXXX 
XXXXXXXXAXAXXXAXXXX 
111@110011X22108 
AXXXXAXAXXAAXAXXX 
1116112811X41126 
XXXEAAXAXXXAXAXXX 
AUXXXXX111G2B2XXX 
AXXXXXXXXXXKXXXXX 
UXXXXX111Z922X xX 
XXXXXAXXAXXAXAXXX 
1XXXXX1111XG1196 
AXXXXAXAXXXXXAXXXX 
18092000021X88108 
XXXXXXXXXXXXXXXX 
1XXXXX1111XO1XXX 
AXXXXXXXXXAXKAXX 
1XXXXX111OXIOXXX 
XXXXXXXXXAXXXXXXX 
1212112011X00168 
XXXXXXXXXXAXXXXXX 
12622000601 401100 
XXXXXAXAXXXAXAAXXX 
1XXXXX111G3OdXXX 
XXXXXXAXXXKXXXXXX 
1XXXXX1111xX¢1120 
XXXXXXXXXXAXXXXX 
1602@001100X2119@ 
XXXXXXXAAKX EX AXAXXX 
1168000100305 XXX 
XXXXXXAXXXAAXXKX 


+ 1211111220 400XX¥ 


XXXAXAXAXAXXAXXXXX 
AUXXXXX1L1CBOSXXX 
XAXARAXXXAXAXAAXX 
16220002601 K091608 
XXXXXXXXXXAAXAXNX 
1020800001 XG0180 
XXXXXXXXXXXXXXXX 
1XXXXX1111XG11e~% 
RAXAXXXXXXXXXXXAX 
LXXXXX1119 Xoo XXX 
XXXXXXXXXXXXXXXX 
1110110011X@1126 
XXXXXXXXXXXXXXXX 
AXXXXX11120SSXXKX 
YXXXXAXFXAXK XXXXXX 
1XXXXX1111X061106 
XXXXXXAXXXAXXXXX 
LXXXXX1119GE0XXX 
XXXXXXXXXXXXXXXX 
UXXXXX111OXESXXX 
XXXXXXXXXXXXXXXX 
1800801610 X2OXxXX 


XXXXXXAXXDSXAXXXX 
XXXXXXXXXX 
XXXXXELKLSXAXKSL 
21811301138 
XXXXXAXK1 SX XXXXX 
XXXXXXXXXX 
XXXXXX@X1IXXXXZ1 
@199129101 
XAXXAXSKGEXAAXZT1 
3124121008 
XXXXXXXXGOLOG1IGXX 
XXXEXXXXXX 
XXXXXXXXVOBd19XX 
XXXXXXXXXX 
XXXXXXXXOOXXXXXX 
6100108111 
XXXXXXXXOEXXXXXX 
120112613 
XXXXXXXXO1LXXXXXX 
XXXXXAXXXX 
XXXXXXXXOLXXXXXX 
XXXXXXXXXX 
XXXXXXEXCOXXXXO1 
8188101126 
XXXXXXXXDSXXXXXX 
@120118011 
XXXXXXXXOSEGSOXX 
XXXXXXXXXX 
XXXXXXXXOOXXXXXX 
@1¢@10€111 
XXXXXXXXOOXXXXXX 
O200221126 
XXXXXXXXO2Z1111XX 
XXXXXXXXXX 


XXXXXX¥X1G1111XX 


XXXXXXXXAX 
XXXXXXXX140011XX 
XXXXXXXXXX 
XXXXXXXXGPXXLXXX 
01092980120 
XXXXXXXXOOXXXAXX 
2182280129 
XXXXXAXXXOOXXXAXX 
0108911081 
XXXXXXYXOGOX AXAXX 
XXXXXXXXXX 
XXKXXXOXESXXXXS1 
@10011¢111 
XXXXXXXXBOO11OXX 
XXAXXXAXXXX 
XXXXXXXXOOXXXXXX 
218028111064 
XXXXXXXXOO211OXX 
XXXXXXXXXX 
XXXXXXXXO@OXXXXXX 
XXXXXXXXXX 


XXXXXXXXOIXXXXXX 


XXXXXXXXXX 


XXXXBSSOLIXXAXZ1A1 
@1020UG1eXXXxeoe 
XXXXSZO1@S1EXXX 
4110446118411XXE 
@11GXXXXXXXAXXXX 
XXXXOSZ11AZ1A1XXX 
XXXXBOGI1LOSB1ZXXX 
XXXXXAXXXXAXXAXX 
XXXXE@XXXXXXXXZ1B 
XXXXXXXXXXXXXXXX 
XXXXO11112011001 
121G@XXXXXXXXG1B 
UXXAXXXXXXXAXXXA 
XXXX@OG1LSESG1SX XX 
XXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXX 
XXXXOULG100C1Z211 
XXXX92@¢10201011 
XXXX2V0910001011 
XXXXOL1L1L1LXXXXXXX 
XXXXB1111XXXXGOO 
XXXXXXXAXXAXXXXX 
XXXXOOLO1LXXXXGOS 
111OXXXXXXXXXXXX 
XXXX3OOG1XXXKBOO 


XXXXXXXXXXAXXXXX 


XXXXOSOU1XXXXBOS 
XXXXOSLIBSXAXXXKXX 
XXXXO101O0XXXX161 


111110110222¢¢80 
1111121100020020 
X912199202000002 
X210190200020222 
XXXYXXXXXXXXCIOS 
X1¢21012000¢e@020 
X100101202200242 
AXXAXXXAXXXXGVW™ 
@e11¢¢119¢e00000 
XXX XXXXXXXXXGSIO 
011002¢190092000 
9011221122000000 
XXXXXXXXXXXXZGVI 
XO22191¢2e0eeEG0 
XXXXXXXXXXXXCLVS 
XXXXXXXXXXXXGIOE 
11¢1eseesogo20ce 
11199ec¢eeeaeoeoo 
1111822200un200 
X1111¢1¢0c¢1eg20e 
11111011090¢0002 
XXXXAXXXXXXXGG*E 
11111011¢¢0¢a2e0 
XXXXXXXXXAXXLGOO 
1291¥81120000020 
XXXXXXXXXXXXSOSW 
10012011¢20002000 
X111101020199020 
8111101192020002 
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XXAXAXXXAXAAAXXA 
LLCOVSCL71 KCX123 
XAXXXXKXXXAAXXAX 
UXAXXKLLIGFOOXXA 
XARAXAAKXKARKAKN 
UXXXXX1112XS1 XXX 


» XXXXXXAXAXXXXAXX 


111211@¢11%2xX12e 
XXXXXXXXXXAXXXXX 
1110112211%@0122 
EXIXAYXAAXXAYXXXX 
1110118011040108 
XXXXXXXAXXXXXAXXX 
1218110211¢2021¢¢ 
RAK AXAXAXKAXXKYY 
UXXXXA1112CE2XXX 
VAKEXAAXAAXAXXNXY 
LXXXXK111ZBXSSXXX 
EXXYXXXXAXXXAXAXX 
1222UCE0Z1X2C10E 
XXXXXXXXXXANXXXX 
1Z22€22eee1A1eC1ive 


D AXXXXAAXARAXKXAX 


iXXXXX111148X124 
EXXXXXXAXNAXXXXX 
1vdOULE001 YOX10e 
XAXAXXAXAXXAAXXXX 
12€0622221X90120 
XFXXXXAAXXANXXXXX 
LXXXXX111O0XDi XXX 
XAXXXXXAXXAXXXXX 
1112112211 X@X16C 
AXAXXXXXXXAXXXXX 
111211€21142¢122 
EXXXXXXXNAXAXAXA 
11161160110021¢2 
XXXXXXXXXXAKXAXXX 
1119110011¢ev1¢@ 
XXXXXAXXXXAAAXXX 
4XXXXX1111Xe910Aa 
AXXXXAXXXXARXXAX 
1XXXXX1115XoO9XXX 
AXXXAKAXXXXAXXXX 
1@0ee¢ceziceele?r 
AXAAXAXAXALAAXXXX 
1402022081 X021¢23 
DEXXXXXXXK AXXXXX 
1XXXXX1111XEX1¢0 
XXXXXXXXXXXXAXXXX 
LXXXXX11LIBAW LY XXX 
EXXAXAXXAXAXXXXX 
1902020601 X00120 
XXXXAAKXXAXXXXXX 
UXXXXX111SXBIXXX 
LLOCLOCVOOILO161 
AXXANXXLLLS KE VXXX 
QOALOSOB00 404011 
UXXXXX111OXCOXXX 


XXXAXXXX@SXAXXXX 
#1Z22eLBee2 
XXXXXXXXSIAXXXXXX 
XKXXXXAAAX 
XAXXXXXXESSXNAKXX 
XXXXXXAXXX 
XXAXXXXSXLSSXXXX1A 
2121100118 
XEXXXX2XASXXXX1¢2 
@121@221@1 
KAXXXXCXC1EL1212 
9191304131 
XXXXXXOXB1G21021 
2121220822 
AXEXXXYXOVDI1IBAY 
XAXKXXXAXX 
XXXXXXXXOLAXXAXXX 
XXXXXXAAXX 
XXXXXXXXCVSAXAAXXK 
@162 C0128 
AXXXXXXXSOXAAXAX 
912¢€¢02122 
XXXXXXXYEOXXXAXX 
128111219 
XXXXXAXAXXCOXXXXXX 
C1SSACZESS 
XXXXXXXXOOXXXXAX 
C1LLCOLOZ2O 
XXXXXXXOLLSXAXXXXX 
XXXXXXXXXX 
XXXXXXOXSLXXXX1E 
1801102119 
KXXXXXOXOOXXXX14 
2101212021 
XXXXXXOXVL211018 
4161821119 
XXXXXXEXO2L11001 
@121¢21114 
XXXXAXKYDAXAXXXX 
@121221122 
XXXXXXXXE1LX AXXXX 
XXXXXXXXXX 
XXXXXXXXZBC1B1XX 
0166282100 
XXXXXXXXCOXXXXXX 
218¢8e21ee 
XAXXXXXXXOOXXXXXX 
0101080118 
XXXXXXXXSOXXXXXX 
XXXXXXXXXX 
XXXAXXXXOGAXXXXXX 
B1ACLT7CO2O 
XXXXXXXXOOXXXXXX 
XXXXXXXXXX 
XXXXOXXXGOXXXXXX 
XXXXXXXXXX 
XXXXBXXXBOXXXXXX 
XXXXXXXXXX 


XXXXAXAXXXAXXXXX 
AXXXIAAXXXXANALA 
XYXX2VALSTEALKXX 
B1VOXXXXEX XXXXXX 
1d13030OGXXXXXKX 
@11G0201E2G1C¥ XX 
C11@VWU1SeESE1exxxX 
XXEKSXAXXXAXXXXX 
XXXXCCC1SLNE ALAA 
XYXX@1111X AXXAXX 
XXXXI1111XXAXDA1 
XXXXXXAXAXXXKXAXX 
AXKAXAXXKX AXA AX 
XXXXIKAXXAXXXL1A 
XXXX@11111111111 
O1LSEX XXXXX AXXAXX 
141 3G9219XXXXXXX 
@11@011111111111 
@118211111111111 
XXXXSVDLOXXXXSO1 
XXXXJOCBXXXXOG1 
XXXXO11111111XXX 
XXXX2111 1AXXXXXX 
XXXXXXAXXXXXXXXX 
XXXX@1B1OXXXXXXX 
XXXXSLBLUX AXXXXXX 
XXXX@@GO1XXXX111 


XXXX2QG1GXXXXBOL 


XXXXBIO1L1AXAXXVBS 


XXXXXXXXAXXK2H2WC 
1411801104290228 
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XXXAXXAXXAXXSLE 
X11111029XXX2vB2 
Xleeeceeazeecces 
X1¢ecoeoeesorcoe 
X@1ZAXXXXPLELLEZ 
@11118Z221922¢0e2 
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XXNEXXXXXXXX2ZOZ 
XXXXXXXXXXXXGSOL 
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XXXXXXXXXXXXCCRB 
X11111999XXX 2020 
11¢1¢e@veceserece 
11¢18¢2eeceoeare 
$11119112220e¢90 
0111101190z28e2028 
X0V1181000282000 
X1111¢1202109200e 
XXXXXXXXXXXXLCL 
X111110@2XXxXC009 
X111110G0XXxX900e 
111112110¢002000 
@111101101090088 
111191121829290 
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1211111111111111 
LXKAXAL1IIL AC SKXX 
CISCLRC ACL SOL cea 
INXXLKLLIORXGSXAX 
€00I9CIdT6001I23 
UXAEXK11127@2XXX 
oe XEAAKXAXXAXXXXL 
PXXXXX1110XGOKXX 
LXYEXXAYAEXNXYXXXX 
UXXXZA111 EXC CKXX 
XXKXXXAXXXAXXXXX 
1X1¢11¢2114¢Z2100 
AXVALPIIX AT AYXNE 
1114112311 4312e6 
LAXNAXXXANAAXYY 
1XX¥XXX1111XG2100 


SF XXXXXXXXXXAXALXX 


AXXXXX11ISASEXXX 
AXAXXAXKAXKXXKXX 
TXXXXX1113KOdXXX 
RXXXXXX7XVAXXXXX 
1110112¢0114161¢¢ 
XXXXXXXXXXAXXXXX 
1118112@11X20122 
AXXXKXXXYX AXXXIX 
1XXXXK1111X63100 
XXXXXAXXXXAAXXXX 
1XXYXX1111 420123 
AXXXXXXXXXAXXXXX 
1¥XXXX1111X%04122 
XXXXXAXAXAXXKXXAXX 


»1XXXXX1111X%4118e 


UXKXXXXXAXXAXXXXX 
1XXXXX1111XBX180 


AXXXCIAXY OVXXXXXY 
XXXXXXXAXX 
XEXXEXXXASXXXAXXX 
XXAXXXYAXX 
XXXXSXXXGAXXXXXX 
XXXXXXXXXX 
XXXXXXF ALOXXYXXX 
AXXXXAXXXXX 
XXXXAXXAXDPEXAXXXX 
XAXXXAXXXX 
XXXXXXIX2OXXXX14 
€1211¢2101 
AXXXXXGRCSXXAX1ID 
41913811111 
AXXXXXXXSOXXXXXX 
0121130226 
XXXXXXXXDOXXXXXX 
XXXXXXXXXX 
XXXXXXXXOASXXXXXX 
XXXXXXXXXX 
XXXXXXCXOLXXXXL1 
©1¢11001¢1 
XXXXXXOKBOXXXX19 
@121182102 
XXXXXXXOLOXAXXAX 
21818011811 
XXXXXXX@ASXXXXXX 
@141211211 
XXXXXXXXGOXXAXXX 
2101210218 
XXXXXXXXSOXXXXXX 
012110€11¢2 
XXXXXXXAXGSAXXXXX 
0108011001 


IAXXESLZERXIASSL 
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$111121121029220 
11111¢¢¢1920¢00¢ 
01111119¢0ee7o00 
X800121002020002 
X12¢101¢9e00¢0c0 
X11111111920¢020 
K111111110020228 
01111¢¢01 0000208 
Xo@e1e1s0e0ecoca 
X3021v1200020200 
11111¢¢110¢02¢0¢0 
1111120¢19920008 
AXXAXKXXXXXXGIG9 
XXXAXXXXXXXXZOVO 
XXXXXXXXXAXX2ESE 
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Am2903 MNEMONICS 


ly FUNCTION 


RAM B — OUTPUT 
Q REGISTER 
SPECIAL FUNCTIONS 


ALU Functions | 


Special Functions 
Fi = HIGH 
Subtract R from S 
Subtract S from R 
Add R and S 
Pass S 
2's Complement S 
"y Pass R 
sy 2's Complement R 
His, Fi = LOW 
“eS Complement R AND with S 
Exclusive NOR R with S 
Exclusive OR R with S 
AND R with S 
NOR R with S 
NAND R with S 
OR RwithS 


ALU Destination Control 


Arithmetic Shift Down, Results Into RAM 
Logical Shift Down, Results into RAM 
Arithmetic Shift Down, Results Into RAM and Q Register ~ 
Logical Shift Down, Results Into RAM and Q Register 
Results Into RAM, Generate Parity 
Logica! Shift Down Contents of Q Register, Generate Parity 
Resuits Into Q Register, Generate Parity 
Results Into RAM and Q Register, Generate Parity 
Arithmetic Shift Up, Results Into RAM 
Logica! Shitt Up, Results Into RAM 
Arithmetic Shitt Up, Results Into RAM and Q Register 
Arithmetic Shift Up, Results Into RAM and Q Register 
Results to Y BUS Only 
Logical Shift Up the Contents of the Q Register 
Sign Extend : , 

’ Results to RAM, Sign Extend 


Special Functjons 


Unsigned Multiply 

Two's Complement Multiply 

Increment by One or Two 

Sign Magnitude «—> Two's Complement 
Two's Complement Multiply Last Step 
Single Length Normalize 

Double Length Normalize 

Two's Complement Multiply Division 

Two Complement Division Correction 


“- 


Am23904 Mnemonics 


INSTRUCTIONS 


=~ «= 


Microstatus Register Instruction Codes 


RSTI Reset zSR ~ iy O > py 
SWAP Register Swap * Mx > Hx 
SHLD Hoid Status f 


{ 
i 


Machine Status Reglster Instruction Codes 


LMA Load Yz, Yc, Yn. Yova Yy > My 
To MSR : 
RSTA . Reset MSR 


SHOLD Hold Status 


O > My 


SHIFT 


$10, 
12 

In © lover 
 QI0, 


Microregister Condition Code Output (CT) 


MIZ Zero wz > Cr 
MIO Overflow Hova > Cr 
MIC Carry wo (> Cr 
MIS Sign Hn o> Cr 


Machine Register Condition Code Output (CT) 


MAZ Zero Mz — Cr 
MAO Overflow Movra -> Cr 
MAC . Cany Mc > Cr 
MAS Sign My > Cr 
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Chapter V 
Program Control Unit 


Pe 


Introduction 


In order to access Instructions and data In an orderly manner 
within a computer, a Program Control Unit is usually used to 
provide the most efficient mechanism for program control. A 
program is a set of instructions which direct the processor to 
perform a@ specitic task. Ordinarily, program instructions are 
stored in sequential memory locations. During the normal pro- 
cessing of a program, an instruction is fetched from the location 
Specified by the program counter, the instruction Is executed, the 
program counter is incremented, and another fetch and execute 
cycle begins. The addressing mechanisms that such control unit 
might employ are various. Indeed there are some machines that 
literally use dozens of addressing modes to fetch instructions and 
data. in this discussion of program control units, several of the 
addressing modes and their commanimplementation techniques 
will be discussed. The addressing modes used commonly in 
today’s machines include register, immediate, direct, indirect, 
index, and relative and various combinations thereof. 


Data Formats F 

Technically, an instruction set manipulates data of various length 
words. Generally speaking, most 16 bit minicomputers can ma- 
: Nipulate data of three different word lengths: 8-bit bytes, 16-bit 
words and 32-bit double words. This data may represent fixed 
point numbers, floating point numbers, or logical data. The datais 
‘used as operands for the instructions, and is manipulated as 
indicated by the particular instruction being executed. 


Typically, fixed point data is treated as signed 15-bit integers in 


the 16-bit representation or as signed 31-bit integers in the 32-bit 
double length notation. Positive and negative numbers are rep- 
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Thus, we will use a few 16-bit and 32-bit formats in this application 
note to demonstrate the function of the program control unit in 
various types of instruction execution. 


instruction Types 


For purposes of this applicalion note, we will define nine different 
instruction types using various addressing modes. As we define 
these instruction types, we will use the basic ADD instruction as 
the example in all cases. It should be recognized that the opera- 
tions of the instructions are similar for all the arithmetic as well as 
logical type operations. However, by using the ADD instruction it 
will be easier to describe the operation of each of these instruc- 
tions rather than to try to be very general in their description. 
Figure 1 shows all nine instruction types with their appropriate 
names. As is seen, four Of the instruction types are single 16-bit 
word instructions while five of the instruction types are double 
word or 32-bit, instructions. The advantage of the double word 
instructions is that a second word can be used as an address 
whereby it provides an mdex value or a second word can be used 
for data which is used as an immediate value. ‘ 


RegIster-to-Register Instructions 


T7"When the register-to-register (AR) instruction is executed, it Is 


- -simply a technique for selecting two of the machine's internal 
working registers in order to execute the desired operation. The 
instruction is fetched from memory and placed in the instruction 
register and the source register R2 and second source register 

- R1 are selected as the two source operands for the ALU. Register 
R1 is the destination register in addition to being a source register 
and the results of the ALU operation will be placed in the register 
specified by the R11 field. In the instruction format shown in Figure 


resented in the ordinary 2's complement notation with the sign bit.-. 4 for the register-to-register instruction, the 8-bil opcode field 


Carrying negative weight. Positive numbers have a sign bit of zero 
and negative numbers have a sign of one. Tne numerical value of 
zero is always represented with all bits LOW. 


Floating point numbers consist of a signed exponent and a signed 
fraction. Many different formats are used by manufacturers in 
expressing floating point data and these variations will not be 
described here. Let it simply suffice to say that the floating point 
number represents a quantity expressed as the product of a 
fraction times the number 2 raised to the power of the exponent. 
In some cases, the number 16 is raised to the power of the 
exponent. Typically, all floating point numbers are assumed to be 
+} normalized prior to their use as operands. No pre-normalization is 
‘performed and all results are post-normalized. Usually, the float- 
ing point instruction set will normalize un-normalized floating 
point numbers. , 


Logical operations are used to manipulate 8-bit bytes, 16-bit 
words or 32-bit double words. All bits participate in the logical 
operations. Ger So 


instruction Formats i ' ! 


Various minicomputers use ¢:fferent types of instruction formats 


tangina from the very simple straight torward formats to the mere” 


complicated dilficult to decode formats. For example, a register to 
register format can consist of a simple 8-bil opcode and two 4-bit 
source operand specifiers. On the other hand, it may consist of a 
byte or word specifier, an opcode specifier, source and destina- 
tion register specifiers, and mode specifiers for each of the source 
and destination register selections. Again, It is not the purpose of 
this application note to describe all of the trade-offs in selecting 
instruction formats but rather fo select a simple-format such that 

the student of bipolar microprogrammed microprocessors can 
understar.d the techniques used by instructions for operating the 
machine. 


specifies the machine operation to be performed. The next 4-bit 
field, 1, in the instruction format specifies the address of the firs 
operand. In most machines, the R1 field is normally the el 
~ of a general register. The 4-bit R2 field in the register-to-registe 
instruction format specifies the address of the second operand; 
this also is normally the address of a general register. In most 
machines, the R1 field also in addition to being a source operand 
is the destination general register select. Thus, the results of the 
operation are stored in the register selected by the R11 field. 


The RR instructions are used for operations between registers. 
_ We are assuming in this discussion that the machine contains 16 
general registers which function as accumulators or index regis- 
ters in all arithmetic and logical operations. Each general register 
contains a 16-bit word consisting of two 8-bit bytes. For arithmetic 
operations, the most significant bit is considered the sign bit using 
2's complement representation. The general registers of the 
machine are usually numbered from 0 to 15 (decimal) and wntien 
in hexadecimai notation as 0 through F. In this example. tne 
general registers have not been given specific functional assign- 
ments. However. in some machines certain registers are as- 
sumed to perform specific functions. These can include spec: 


_.., Stack pointer registers and program counter registers. Figure 2 


: depicts the typical signal path for executing the AR instruction tn 2 
bit-slice system. 


The actual operation of the Recister-to-Register Instruction ts as 
follows. First, the instruction is fetched and placed in the instruc 
tion register as shown in Figure 2. This ts part of the fetch routine 
Next, the instruct.on is decoded via the mapping PROM and the 
appropriate microinstruction in the microprogram merici) 
selected and placed in the pipeline reaiste:. Then, the instruc’ 16 
Is executed where the two registers in the genaral purpose fc 
lers of the Am2393 are seleciac by the contents ofthe Rt ano 2 


" helds of the instruction register The actual microcode requried ‘> 
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Register-to-Register = 
778 wWh12 15 


ADD INSTRUCTION 


: 


(R1) (RI) + (R2) 


Register-to-Memory Reference 9 f9515 4 20 se ito 
PA 15 * 893.q whsnnO Ae ‘ 
| x2_| (RA) (AI) + [(K2)P Seep e ee 
ig t) otra ’ 1 Siq ba : 
‘ 15 ( & 50 
[ x2 | © [Oe] © [ocn] + [0x2] 
Register Short Immediate che ato 
0 15]- UloWsco MA po ator 2 I 
| Rt | DATA | °° (RN) & (RT) + DATA! Oo 5 295 
Register-to-Indexed Memory OW } 1 a r bulce 
0 31) yal cAousmaer sONSv b , 
(Rt) + (Rt) + [(x2) + A] 
F | ts 2ote!uc 
j i's Bee te Siti Nea es 
ate (R1) + (RI). + DATA.+ [(%2)]}. 
(RI) = (Rt) + DATA _ 
i] YY, 0 + 
‘ gif eh 2 88T ar0 lor avs 
© [xn] — [en] + DATA 
Note: (R1) means the contents of register 1. ; , 
[(X1)] means the contents of the word whose address Is in R1. | 
Figure 1. Varlous Instruction Types for the ADD operation. 
own 
1% 
; S4 
wr | 


Figure 2. Reglster-to-Register Instructions Select Two Registers In the Am2903 Array for Instruction Execution. 


execute this instruction is shown in Figure 3. Here, we assume _ 


the Program Counter (PC) value Is contained in one of the gen- ~ 


ral registers and can be selected by microcode as well as} a Ri1 
and R2 fields. This was shown in Chapter 3. Le 
Register-to-Memory-Reference j H 

The register-to-memory-reference instruction is one whereby the 
contents of the memory location pointed to by the register iden- 
tified with the X2 value is fetched from memory and then added to 
the register value specified in the R1 field. The result of this 
operation ts piaced in the register specified by the Ri field. 
Figure 4 shows a general block diagram of the hardware used to - 
implement the instruction types described in the first part of this 
application note. As shown, the memory address register can be 
driven by either the Y outputs or the DB outputs of the Am2903s. 


Microinstruction 
Operation 
PC -> MAR; PC + 1+ PC 
Fetch Inst to IR 
Decode ; 
R1+R2 +R 


“ “MICROPROGRAM 
~MEMOR’ 


PIPEUNE 
REGISTER 


' 


} Caaf 
———— 


ES 


OATA 


MEMORY 


(MEM) 


‘In addition, the Y outputs of the Am2903s can be placed onto the 


memory data bus by means of a three-state buffer. The computer 
control unit is intended to be representative of that described in 
Chapter 2 of this application note series. For purposes of this 


discussion, we assume the program counter (PC) is one of the’ 


general purpose registers within the Am2903 register stack. 
Later, we will change this concept and use the PC extemal to 
Amz2903, ' . 


The operation of the register-to-memory-reference instruction as 
depicted in Figure 1 can best be described by reterring to Figure 
5. Here, we see the first three microinstructions that represent the 
fetch routine for the currently described machine. First, the pro- 
gram counter is placed in the memory address register and the 
program counter is incremented and retumed to the PC register. 


Microcycle Time 


AB GENERAL 
PURPOSE 


4 = 
por] oor] 
va 
eee eee 
—— 


MEMORY 
ADORESS 
REGISTERS 


Figure 4. Simple Memory Addressing Scheme with PC in the ALU. 


193 


OP wre 


ma Kw we 


” thet 
ta 
te 

iad 

ey 


194 


, * 
Microinstruction ~. 
Operation 


PC MAR; PC + 1 PC 
Fetch Inst to IR 

Decode 

(X2) -> MAR 

MEM + R1 > R2 


Figure 5. Register to Memory Reference instruction Microcode. 


Next, the Instruction is fetched from memory and placed in the 
instruction register within the CCU. Thirdly, the instruction is 


decoded via the mapping PROM and the appropriate micro- ~~ 


instruction selected and placed in the pipeline register. To exe- 
cute this particular register-to-memory-reference instruction, itis 
necessary to place the contents of the register specified by the X2 
field into the memory address register. Then the contents of 
memory can be fetched and the operand added to the value 
currently contained in the register specified by the Ail field. The 
result of this operation is placed In the register specified by the R1 
field. All totaled, the execution of this register to memory refer- 
ence instruction requires nee microcycles as depicted in this 
example. 


Memory to Memory 


This instruction is one whereby the memory location pointed to by 
the contents of the register specified in the X2 field is fetched and 
the memory location pointed to by the contents of the register 
locations specified in the X1 is fetched and these two operands 
are added together. Al the completion of the Instruction, the 
resultant is placed in the memory location as defined by the 
contents of the register specified in the X1 field. 


The Memory to Memory Instruction operation Is also depicted by 
the block diagram shown in Figure 4. In fact, all of the next six 
instructions to be defined utilize the block diagram of Figure 4 to 
represent the hardware required for implementing these instruc- 
tons. 

The microcode required for the memory fo memory instruction is 
detailed in Figure 6. The first three microinstructions represent 
the fetch routine. In the fourth microinstruction, the contents of the 
register specified by the X2 field are placed in the memory ad- 
dress register. Then, in the fifth microinstruction the contents of 


Microinstruction \_ 
Operation 


PC - MAR; PC + 1 PC 
Fetch Inst to IR 

Decode 

(x2) + MAR 

MEM +Q 

(X1) + MAR 

MEM + Q +Q 

Q —+MEM 


this memory location ts loaded into the Q register within the 
Am2903. This value is temporarily held for use later. In the sixth 
microinstruction, the contents of the register specified by the X1 
field in the instruction is placed in the memory address register. 
On the seventh microinstruction, this operand is fetched from 
memory and added to the contents of the Q register with the result 
being placed in the O register. In the eighth microinstruction, the 
current contents of the Q register is returned to the memory 
location. This memory location Is specified by the contents of the 
register specified by the X1 field and is still inthe memory address 
register. Thus, we have used the O register as a temporary 
holding register for the data used in this instruction. 


Register with Short-immediate 


This instruction Is a technique whereby a 4-bit field is added to the 
contents of the register specified by the R1 field. Thus, short 
jumps or branches can be executed within a range of zero to 
filteen memory locations. The more significant 12-bits of the word 
are zero filled. 


The register with short iminediate Instruction operates very simi 
lar to the register-to-register instruction. The microcode for this 
instruction is shown in Figure 7. The only difference between the 
fegister-to-register instruction and the register short-immediate ~ 
instruction is that instead of adding operands specified by the R1 
and R2 fields, we take a data value contained In a four-bit field in 
the instruction as depicted in Figure 1 and add it to the contents of 
the register specified in the Al field. The results of the operation 
are retumed to the register specified by the R1 field. This addition 
ts performed by taking the 4-bit data value shown In Figure 1 as 
the DATA and zero filling the twelve most significant bits. This 
gives us a 16-bit word ranging in value between zero and fifteen. 
Thus, short jumps can be implemented using this technique. 


Microcycie Time 


Figure 6. Memory to Memory Instruction Microcode. 


Microinstruction 
Operation 


PC - MAR; PC + 1 PC 
Fetch Inst to IR 

Decode 

R1 + Data ~R1 


Figure 7. Register Short Immediate Instruction Microcode. 


Register to Indexed Memory 


The 16-bit word in the register defined by X2 in the instruction Is 
added to the address that is the second word of memory. Then, 
this address is used to fetch an operand from memory which is 
added to the contents of the register pointed to by R1. The results 
of this operation are then placed in R1. The instruction format for 
this instruction was shown in Figure 1. 

The Register to Indexed Memory Instruction is shown is Figure 8 
and executed in the following manner. First, the current PC value 
ts placed in the MAR and PC + 1 is retumed to the PC register. 
Next, the instruction at this memory location is fetched and placed 
in the instruction register. On the third cycle this Instruction is 
decoded and the contents of the microprogram memory placed in 
the pipeline register. On the fourth microinstruction, the PC value 
is again placed in the MAR and PC + 1 Is returned to the PC 
register. On the fitth microinstruction, the value at this location in 
memory is fetched and added to the contents of the X2 register 


Microinstruction 
Operation 
PC.-> MAR; PC + 1 PC 
Fetch Inst to IR 
' Decode } 
PC - MAR; PC + 1 PC 
MEM + X2— MAR 
— MEM + R1>R1 


Microlnstruction 
Operation 


PC -> MAR; PC + 1 -*PC 
Fetch Inst to IR 

Decode 

(X2) - MAR 

MEM + Ri Ri 

PC > MAR; PC + 1 > PC 
MEM + R1 > Rt 


with the result being placed in the MAR. And on the sixth mic- 
roinstruction, the operand pointed to by this adcress is fetched 
and added to the contents of R1 with the result being placed in the 
register pointed to by the Ai field of the instruction. 


Register to Memory Immediate 


In the register to memory immediate Instruction, the contents of 
the memory location pointed to by the register specified in the X2 
field is fetched from the memory and the data value which fs in the 
second word of the instruction is also fetched from memory and 
added to it. This result is then added to the contents of the Rt 
tegister and the final result replaces the value currently in Ri1. 


The register to msmory immediate instruction as shown in Figure 
1 ts implemented using the microcode shown in Figure 9. Again, 
the first three microinstructions are the fetch routire. The fourth 
microinstruction is used to take the contents of the register 
specified by the X2 field and place it in the memory address 


Microcycle Time 


Figure 9. Register to Memory Immediate Instruction Microcode. 
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register. Next, the operand at this memory location is brought into 
the Am2903's and added to the contents of the register specified 
by the Ri field with the results retumed to that register. The sixth 
microinstruction is used to set up the memory address register to 
fetch the second word of the instruction. Tha seventh micro- 
instruction brings this data value into the Am2903 ALU via the 
data bus and adds this value’ to the contents of the register 
Specified by the Ai1 field. The result of the operation is placed into 
the register specified by the Rit field. . 


Momory to Memory Indexed ~—— see 


The memory to memory indexed instruction is one whereby the 
contents of the register specified in the X2 field are added to the 
second word of the instruction to form a new address. This 
address is then used to fetch an operand which is added to the 
operand selected by taking the contents of the register specified 
in the Ri1 field and using that as a memory address to fetch an 
operand. The result of this addition is then replaced in the mem- 
Ory location pointed to by the contents of the register specified in 
the X1 field. 


The memory to rhemory indexed instruction Is probably the most 
comp!cated of the instruction formats described in the application 
note. In all, nine microinstructions are required for its implemen- 
tation. Basically, the first three microinstructions are used to fetch 
the instruction from memory, place it in the instruction register, 
and decode the instruction for initial operation. Again, the basic 
fetch routine. Microinstruction number 4 sets up the memory 
address register to fetch the second word of the instruction and 
microinstruction number 5 is used to bring this value from mem- 


Microinstruction 
Operation 
PC ~» MAR; PC + 1+ PC 
Fetch Inst to IR 
Decode 
PC MAR; PC +1-PC _ 
MEM + X2 - MAR 
MEM ~Q 
(X1) + MAR 
MEM +Q-~Q 
Q - MEM 


Ory into the Am2903 ALU where it is added to the X2 register. The. 
results of the addition are placed into the memory address regis- 
ter during this microinstruction. This value is used to fetch a value 
from memory which is placed In the O register using micro- 
instruction number 6. In the seventh microinstruction, the con- 
tents of the register pointed to by the X1 field ara placed in the 
memory address register so that microinstruction eight can be 
utilized to bring this memory value into the Am2903s where it is 
added to the contents of the O register with the result being 
placed into the Q register. Microinstruction number 9 is used to 
place this value back into the rhemory location as specified by the 
centents of the register pointed to by the X1 fidid. This memory 
address ts still contained in the memory address register so that 
No updating is required. The total microcode required to imple- 
ment this instruction routine is shown in Figure 10. 


Register Immediate 


The register immediate Instruction is a very useful instruction 
which allows data to be added to the contents of the register. in 
this example, the second word of the instruction is fetched and 
added to the contents of the register specified in the R11 field. 


Figure 11 depicts the microcode used to implement the register 
immediate instruction. Here, the first three microinstructions are 
the fetch routine for the instruction. The fourth microinstruction of 
this routine sets up the MAR to fetch the second word of the two 
word instruction. The contents of this memory location is brought 
into the Am2903 ALU and added to the contents of the register 
specified by the R11 field. The result of this pracalion is placed in 
the register specified by the R1 field. 


Microcycle Time 


Figure 10. Memory to Memory Indexed Instruction Microcode. 


Microlnstruction ae 
Operation | TO} T1 
PC -» MAR; PC + 1 PC 
Fetch Inst to IR 


Decode 
PC + MAR; PC + 17 PC 
MEM + R1— Rt 


Microcycie Time 


Figure 11, Register Immediate Instruction Microcode. 


Memory immediate - 2 Morel 


The memory Immediate instruction Is used to add immediatgdata 
contained In the second word of tho instruction to a location in 
memory. The memory location is contained in the register 
specified in the X1 field of the Instruction. 


The memory immediate instruction is similar to the register im- 
Mediate instruction except that an indirect addressing scheme is 
used. Again, the first three microinstructions fetch and decode the 
memory immediate instruction. The fourth and fifth microinstruc- 
tions are used to fetch the data value which is the second word of 
this memory immediate instruction. Microinstruction number 4 
sets up the memory address register and microinstruction 
number 5 brings the data into the Am2903 Q register. Micro- 
instruction number 6 places the contents of the register specified 
by the X1 field into the memory address register so that the 
contents of this memory location can be brought into the Am2903 
during microinstruction number 7. Here, during microinstruction 7 
the contents of the Q register are added to this value and returned 
to the Q register. At microinstruction 8. the Q register is written 
back to the memory location as specified by the contents of the 
fegister pointed to by the X1 field. This value was already in the 
memory address register because it was used to fetch the 
operand originally at this location. The microcode for this instruc- 
ion is detailed in Figure 12. meen 


Microinstruction A! : 
Operation 


PC — MAR; PC + 17% PC 
Fetch Inst to IR 


Decode ; 
PC - MAR; PC + 1 + PC 
MEM ~Q 

(X1) + MAR 

MEM +Q-~Q 

Q-~MEM 


improving Program Controi Unit Performance 


if we examine the microcode as shown for the various instruction 
types depicted in Figure 1, we find that all of these microroutinas 
have several things in common. First, the very first microinstruc- 
tion simply sets up the memory address register with tra current 
value of the program counter in addition, this microinstruction 
increments the current program counter value. The second mi- 
croinstruction simply fetches the contents of memory and places 
it in the instruction register. The third microinstruction is used to 
decode the microinstruction, select the appropriate micromemory 
word and set it into the pipeline register. Fina'ly, the fourth micro- 
instcuction begins actual execution of the desired instruction. in 
all of these examples and using the block diagram of Figura 4, we 
find that a bottle neck occurs in the ALU Lecause of ourneed tobe 
operauing on program counter data and operand data intermi«ed. 
We can improve the performance cf the program control unit by 
making the program counter an external register and using a 
multiplexer to select either the program counter or the Am2903 
output to load the memory address register. This is depicted tn 
block diagram form in Figure 13. 

The first effect of implementing a program contro! unit with this 
architecture is that one of the instruction types is shortened by 
one microcycie. This is the register-to-memory-immediate in- 
struction. The new microcode flowcharts for thts instruction ss 


Microcycle Time 


Figure 13, Memory Addressing Scheme with PC Outside of the ALU. 
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shown in Figure 14. In this case, we see that a PC value can be 
placed into the memory address register and the PC incremented 
while the ALU within the Am2903 is being used to perform either a 
pass or an addition. Thus, this architectural change has made 
some improvement in the thru-put of our machine. 


The most important improvement in thru-put realized by the ar- 
chitecture shown in Figure 13 can be seen by evaluating the 
timing for sequential instrucbons. That is, what happens when 
several instructions are executed sequentially? 


Tokeep the examples simple, let's visualize the microcycie timing 
chart for three register-to-register instructions executed sequen- 
tially. The most obvious timing chart would simply be to take the 
register-to-register microinstruction flows as shown In Figure 3 
and concatenate three examples of this timing chart. If we do this, 
we will see that the final execution of the values of A1 + R2retum 
to R'1 utilize the ALU, but the program counter is not in operation. 
However, the next microcycle requires placing the program 
counter into the memory address register. Thus, the architecture 
of Figure 13 allows us to do these two micro-operatons during the 
same microinstruction. If we assume three register-to-register 
instructions in sequence In memory; let's call them instruction A, 
B and C; the timing chart of Figure 15 results. What we see in this 
diagram is that the execution of instruction A can be overlapped 
with the set up the program counter in memory address register 
for tatching instruction B. Thus, instead of instruction B starting at 
time T4, it may be started at time T3. This can be accomplished by 
simply having the execution microinstruction also load the MAR 
with the current PC value and increment the PC. From this dis- 
cussion, we can see that instead of twelve microcycle times being 
required to execute three register-to-register instructions, only 
nine microcycie times will be required. We should caution that if 
the reader counts the microcycies in Figure 15, he will arrive at 10 
microcycie times being required. This leads us to our next point. 


If we examine all of the instructions described earlier in this 
appiicaton note, we will find that in all cases, the execution of the 
instruction (the last microcycie) can be overlapped with the first 


Microinstruction ¢ 
Operation 
PC -- MAR; PC + 1+ PC 
Fetch Inst to IR 

Decode 

(X?) + MAR 

MEM + R1 Rt 

PC — MAR; PC + 1 Pu 
MEM + R11 R1 


Operation 


PC MAR; PC + 1 +PC 
Fetch Inst to IR 

Decode 

R1+ R2—-R1 


microinstruction of the fetch routine. Thus, the architectural 
change shown In Figure 13 not only allows three of the instruc- 
tions to execute faster during their total microcode, but in fact all 
microinstructions can be executed at least one microcycle faster 
because of the ability to overlap the first microcycle of the fetch 
foutine with the execution of the instruction. This architectural 
change therefore saves one or two microcycles depending on the 
instruction. ' 


In Chapter 9 we will show how further overlapping at the machine 
instruction level can allow us to execute a register-to-register 
instruction during every microcycle, effectively;, rather than every 
three microcycies as shown in Figure 15. At the present time, let 
us Simply leave the discussion at this point. 
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Subroutining 


An implementation technique thal is common to the different 
addressing modes Is the subroutine (also called stack and link). 
The subroutine allows sections of main program to access a 
common subsection of the program. The general effect is to allow. 
less lines of machine code to be written for any given prosiany that 
employs subroutines. 


Figure 16 shows an example of a subroutine within the program. 
The main program executes instructions until it gets to instruction 
52 whichis acall to subroutine. This instruction puts address BO in 
the program counter while saving address 53 in a separate reg- 
ister called Return Register. The program continues on from 
address 80 to address 85 where it encounters the retum from 
subroutine command. The return-from-subroutine command 
takes a value out of the return register and puts that into the 
Program cuunter. At that point the program counter continues 
down in the main body of the program until it reaches address 57. 
At this time, another call to subroutine may occur forcing the 
program counter back to the value of 80 white putting the value 58 
into the retum ado-ess. The subroutine is executed and at ad- 
dress 85 the return command Is again encountered.-At this point, 


Microcycle Time 


Figure 15. Register to Hegisier instruction with Overiap of Execute and PC Control. 


SUBROUTINE 


Figure 16. Subroutine Execution. 


the subroutine will return control of the program to address 58 of 
the instruction stream and the main program continues to se- 
quence through its instructions. 


In many systerns, one subroutine may very well call another 
subroutine which may in turn call yet another subroutine and so 
on. To accomplish this the return address linkage must now be 
“nested” using a last-in first-out (LIFO) stacking arrangement. 
Figure 17 ilustrates subroutine nesting. In this example, the main 
program contains a subroutine cali or jump-to-subroutine com- 
mand (JSB) at address 53. Prooram control Is passed to the first 
subroutine at address 88, while the return address 54 is placed in 
the stack. At address 89 the of the subroutine 1 another JSB 
command ts encountered passing the program control to Sub- 


routine 2 at address 502. The return address value 90 is pushed 


onto the top of the stack. Tnis continues in like fashion for calls to 
Subroutine 3 and 4 with return address 506 and 723 being placed 


on the stack. At address 785 of Subroutine 4, a Return from ~ 


Subroutine (RTS) command is decoded causing the return ad- 
dress 723 on the top of the stack to be placed in the program 
counter and the contents of the stack are “poped” up one piace. 


MAIN | 
PROGRAM 


SUBROUTINE 1 


SUBROUTINE 2 


At address 725 another RTS command is found, causing the top 
of the stack, address 506, to be placed in the program counter 
and the stack is poped. The identical action occurs for the RTS 
commands at address 507 and 92 such that control is eventually 
returned to the main program and the stack is empty. 


The LIFO or subroutine stack in the program control hardware is 
shown in Figure 18. When the call from subroutine command is 
decoded by the computer control unit, the pipeline register out- 
puts cause the stack control to accept the output of the program 
counter register and place it at the top of the stack. Next the 
subroutine address is brought in from the memory passed 
through the multiplexer and placed in the MAR. The subroutine 
address ts also brought through the multiplexer incrementer, 


. through the incrementer and placed in the program counter reg- 


ister to be used as a possible next source of address. The sub- 
routine return address is recovered from the stack when the 
pipeline register instructs the stack control logic to place the 
return address at the multiplexer. The retum address is passed 
through the multiplexer and clocked into the MAR. The return 
address is also clocked into the PC register via the incrementer 
multiplexer and the incrementer, for use as the next sequential 
address. Figure 19 shows the jump to subrcutine instruction and 
Figure 20 shows the microcycles that are used in a typical call to 
subroutine command using the program control hardware shown 
in Figure 18. At TO the program counter is placed into the MAR 
and updated. Time T1 finds the MAR accessing the subroutine 
call instruction, with the instruction being placed into the instruc- 
tion register. At T2 the opcode is decoded by the CCU, and the 
first instruction microcode bits are clocked into the pipeline reg- 
ister. At time T3, the PC is placed in the MAR. At T4 the starting 
address of the subroutine is being fetched and placed inio the 
MAR; the stack pointer is incremented: the current program 
counter is placed on the LIFO stack; and the starting address o! 
the Subroutine plus one is placed into the program counter. 


Figure 21 details the microcycle timing for a retum-from-sub- 
routine execution. At time zero the current program counter Is 
placed into the MAR, then incremented by one. During time one 
the contents of tne MAR fetches the return from subroutine com- 
mand, which is then clocked into the instruction register at the end 
of the microcycle. At time 2 the contents of the instruction register 
is decoded in the CCU with the contol bits being clocked into the 
pipeline register. During time 3 the retum address on the top of 
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Figure 17. Nested Subrouline Sxumple - 
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Figure 18. Subroutine Stack Architecture. 


Microinstruction 
Operation 
PC -> MAR; PC + 1-+PC 
Fetch Inst to IR 
Decode 
PC + MAR; PC + 1 +>PC 


MEM -* MAR; PC + STACK 
MEM + 1 PC; SP + 1 SP 


Figure 20. Branch and Stack Instruction Microcode. 


Microinstruction Microcycle Time 
Operation 
PC -> MAR; PC + 1 ~PC 
Fetch Inst to IR 
Decode 


Stack + MAR; Stack + 1 +SP 
sp -1-SP 


Figure 21. Return from Subroutine Instruction Microcode. 


the LIFO stack Is placed into the MAR, while that value plus one is 
Stored into program counter. The stack pointer is then 
decremented. 


The basic program control hardware thus developed with some _ 


embellishments added are contained within the Am2930 program 
control unit as shown in Figure 22. The Am2930 is a 4-bit slice of 
the program control unit. It therefore easily allows the address 
bus to be virtually independent of the data bus In terms of width. 
The Am2930 has a general purpose auxiliary register which has 
two sources and two destinations. One source being the D inputs 
which flow through the R multiplexer and hence into the auxiliary 
register and the other source being the output of the full adder 
which is the second input to the R multiplexer. The two outputs of 
the auxiliary register go to the A and B multiplexers which in turn 
source the A and B inputs to the (ull adder. The register enable pin 
(RE) allows the auxihary register to be unconditionally loaded 
from the D Inputs of the Am2930. The A multiplexer selects as its 
Sources a logical zero, the output of the auxiliary register, or the D 
inputs. The B multiplexer accepts the outputs of the auxiliary 
fegister, a logical zero, the output of the subroutine stack file, or 
the output of the program counter register as its sources. 


In the Am2930 design the LIFO stack is 17 words deep, allowing 
up to seventeen levels of subroutine. The LIFO stack is controlled 
by the stack pointer logic which gives a FULL indication when the 


stack is full and an EMPTY indication when the stack has 
emptied. The input to the LIFO stack is fed through a stack 
multiplexer whose inputs may be D inputs or the output of the 
program counter. Thus. depending upon the application, the 
stack may be used as either a subroutine stack or a general 
purpose LIFO stack which resides on the D bus. The incrementer 
and the full adder are coniro!led by the Ci and Cn carry-in bits 
respectively. Figure 23 details the ripp!e carry connections be- 
tween Am2930s in a 16-bit array. The Ci input of the least signifi- 
cant slice (LSS) is controtied from the pipeline register. 


The Ci signal ts intemally propagated through the incrementer of 
each device using carry look ahead logic. The microprogram 
memory, using the Ci input may now cause the Am2930s to 
repeatedly access the same main memory instruction if so de- 
sired. The full adder has its Cn input tied to ground for the LSS 


~ device of the Am2930 array. The Cn signal! is progagated in 


parallel through the Am2920s. 


For a faster propagation of the Cn signal the interconnection 
shown in Figure 24 should be employed. The generate and 
propagate pins (G, P) of the Am2902A carry look ahead 
generator. The look ahead carries (Cn + x, y, z) are connected to 
the Cn inputs of their respective devices. The output of the 
Am2930 is three-state and is controlled by the output enable pin 
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Figure 22. Am2930 Block Diagram. 
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Figure 23. Ripple Expansion Scheme for Am2930's. 


Figure 24. Paraliel Look-Ahead Expansion Scheme for Am2930's. 


(OE). Other features of the Am2930 include an Instruction Enable 
pin (IEN). This pin allows the Am2930 array to be taken off of the 
microprogram data bus thus allowing the bits that were formerly 
committed to the Am2930 to be used in conjunction with other 
devices. The Am2930 also includes a condition code input (CC). 
The Condition Code input permits the conditional testing of a 
single bit. This allows the feasibility of such techniques as condi- 
tional branching at the macroprogram level. For more detailed 
explanation of the Am2930, its instructions and its applications, 
see the Am2930 Data Sheet. Figure 25 shows a typical system 
interconnection using the Am2930. The instruction lines, Ci, RE 
and the OE control pins are connected directly to the outputs of 
the combination micropregram memory and pipeline registers 
coniained in the Am24775 devices. The condition code inputs are 
obtained from the Am2S04 stetus and control device, thus allow- 
ing conditional jumps on status. Status from the Am2904 is also 


fed into the test mux for use by the Am2910 for its conditional 
code input. Likewise the full and empty Indications from the 
Am2930 are fed into the test MUX for use by the Am2910 to 
ascertain the current status of the stack. If the stack is full and the 
user wishes to push the data onto the stack then the current data 
must be emptied from the stack under microprogram control, 
using additional hardware. 


Another feature of the Am2930 Program Control Unit as shown in 
Figure 22 Is the full adder between the program counter and Y 
outputs. This allows for the execution of PC relative addressing 
types of instructions. While this can be an effective addressing 
scheme, it will not be covered in detail in this application note. 


While the Am2930 offers advantages in small high performance 
systems requiring a small LIFO stack, it is not intended to be the 
solution for alt program counter requirements. 


Figure 25. System Interconnection Using the Am2930. 


Using the Am2901A as 2 Program Controt Unit 


Up to this point, the discussion has concemed a general ar- 
chitecture which includes 16 general registers in the ALU section 
and the LIFO stack is a program control section as shown in 
Figure 18. An aitemative architecture and that used by most 
general purpose machines, is to place the LIFO stack in main 
memory. The stack pointer for the main memory LIFO stack can 
be contained in the program control unit to be described in this 
section. If the program control unit is built using Am2901A's it now 
has the capability of using its internal registers as the program 
counter, stack pointer, upper stack bound pointer, lower stack 
bound pointer, and intemal temporary registers. This of course 
provides considerable flexibility in the architecture and also al- 
lows for a much greater repertoire of instructions to be executed. 
Particularly, several stack instructions can be included in the 
instruction set, most of which will use the form of the register-to- 
indexed-memory instruction format as shown in Figure 1. 


Another advantage of the architecture shown in Figure 25 is 
Speed. The Am2901A's slightly surpass the Am2903 in speed. 


MPR-57) 


Thus, a 16-bit Am2901A program control unit architecture can be 
implemenied and it will perform well within the microcycle umes 
budgeted for the system. 


Looking at Figure 26 which shows the Am2901A used as a 
Program control unit and the Am2903 used for the general regis- 
ter stacks/ALU section, we see a three-state buffer on the Y 
outputs of the Am2903 connected to the data bus as well as a 
three-state buffer at tne input of the Am2903's from the data bus. 
This provides isolation and buffering for the bus as well as allow- 
ing appropriate disconnects so that certain microcycies can be 
combined to improve the overall performance of the machine. In 
addition a transfer register is used between the Am2303's and 
Am2901s to allow a microcycie to be terminated if an ALU opera- 
tion is taking place within the Am2903's. This provides higher 
performance operation for the machine. In addition, a bi-direc- 
tional butfar (such as the Am6304B) is used between the 
Am2901A Y-outpuls and the Am2903 Y-outputs. This gives the 
ability to push the program counter contained in the Am2901A on 
the stack for interrupt handling. In addition, values coming from 
the Am2903 can be placed in the memory address register. 
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Figure 26. PCU Architecture Using the Am2901A. 


Summary 


The thrust of this discussion has been aimed at defining and 
implementing hardware to accomplish addressing of main mem- 
ory. We have shown that a speed advantage is realized if the 
program counter is kept separate from the main genera! purpose 
register stack/ALU hardware. The most general purpose program 
control unit is the Am2901A. It offers several advantages in terms 
of program control, stack pointer control, and stack pointer 
boundary conditions. The Am2930 can be used in program con- 
trol units occupying less space and including a built-in stack, but 


has some speed and performance limitations. Both devices can 
be used to implement the basic addressing modes associated 
with the instructions described in this application note. 


Another purpose of this application note is to set the stage for 
Chapter 9 where we will overlap machine instructions such that 
register to register instructions can be executed in a single 200ns 
microcycle and the memory reference instructions can be exe- 
cuted in 600ns (3 microcycles) as the effective execution time. 
Also, we will expand on the use of the Am2901A as a Program 
Control Unit. . 
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INTRODUCTION 


A digital computer can be viewed as a finite state machine that 
moves trom state to state via the execution of a program. inter- 
tupt mechanisms provide a well-defined way of altering the llow 
of states in response to outside asynchronous events (inter- 
rupts). There is a wide variety of ways of handling interrupts 
depending upon the system requirements. The choice of a par- 
ticular interrupt mechanism can have a large impact on the 
through-put and flexibility of a system. Therefore, time should be 
spent carefully defining the interrupt mechanism of a new com- 
puter design. 


POLLING VS. NON-POLLING 


One of the simplest ways to handle asynchronous events is the 
polling method. With each possible event there is an associated 
tlag that can be accessed by the program. The processor then 
interrogates each flag in order to determine if service is required. 
This method trades simple hardware for software. This not only 
uses memory space but also uses time for polling the flags when 
No service is required. The polling.method has low system 


through-put, high real time overhead and slow response time. 


In non-polling systems, the asynchronous event generates an 
interrupt request signal wnich is passed to the processor. The 
Processor in turn suspends the execution of the current process 
and starts execution of an interrupt service routine. When the 
interrupt routine is.completed, the processor resumes execution 
of the suspended process. This system is called an interrupt 
driven system because it executes interrupt service routines that 
are inivated by interrupt requests. 


Athough the non-polling method requires more hardware, it has 
Many advantages. Because the execution of interrupt service 
routines is transparent to the current process, less thought and 
time is required of the programmer of the current process. The 
response time is faster because no time is spent interrogating the 
other non-active interrupts, which in turn increases the system 
throughput. There is less real time overhead and less memory 
Space required because only the service routine exisis in memory 
and no polling routine is required. 


MACHINE VS. MICROPROGRAM LEVEL INTERRUPTS 


There are two levels on which interrupts may be handled. The 
first and most common is the machine level interrupt. In this 
method possible interrupt requests are checked for during the 
machine instruction fetch cycle. This guarantees that an inter- 
tupt can only happen when a machine instruction is complete 
and before a new instruction starts. - Lane ace 


The second level of handling interrupts is on the microprogram 
level. In the machine tevel interrupt system, the microprogram 
has complete control of when to recognize an interrupt but in the 
micrcprogram level system the microprogram can be interrupted 
.at any time. This method has a smaller response time for ser- 
vicing interrupt requests but requires that restrictions may be 
placed on the microprogram and the interrupt mechanism. 
These restrictions come from setting aside space on the finite 
microprogram stack in the sequencer for possible Interrupt re- 
quests. Special co sideration may also have to be given to loop 
counters. scene - 


TYPES OF INTERRUPTS 


There are basically four types of interrupts based on the re- 
lationship of the source of the interrupt to the processor: within 
the processor, within the system, between software, and be- 
tween processors. A multiprocessor has to be able to handle all 
four levels of interrupts. Therefore, the interrupt structure that is 
picked will have these design tradeotfs to consider. 


A. Intraprocessor interrupts are those asynchronous events 
that happen within the processor during the execution of a 
machine instruction. This group includes such things as zero 
divide, overflow, accessing restricted memory, execution of 
a privileged instruction, machine failure, etc. 


B. Intrasystem interrupts are interrupts created by system 
penpherals such as disks, CRT's and printers that require 
service. 


C. Executive interrupts are those interrupts caused by the cur- 
rent program that is executing. This provides a way for the 
current program to make a request of the executive (operat- 
ing system) program. These requests might include such 
things as starting new tasks, allocating hardware resources 
(disks, line printers), communication with other tasks, etc. A 
good example would be the supervisor cali (SVC) in the IBM 
360/370 computers. 


D. Interprocessor interrupts include those interrupts between 
two intelligent processors. For example, this class of inter- 
tupts would be used to initiate data and status transfer be- 
tween a local processor and a processor at a remote site. 


SEQUENCE OF EVENTS FOR INTERRUPT HANDLING 


When an interrupt occurs there is a sequence of six events that 
happen. These events, which can be implemented in microcode 
or machine code, integrated together with the hardware com- 
prise the interrupt mechanism. The sequence of events de- 
scribes the steps that occur to provide for a smooth transter fram 
the current process environment to an interrupt servicing envi- 
ronment and back again. The sequence ensures that the proces- 
sor status will be the same immediately after an interrupt is 
serviced as immediately before the interrupt occurred. The 
events listed in the next few paragraphs may differ in order or 
overlap depending upon the machine design and application. 


Interrupt Recognition 


This step consists of the recognition of an interrupt request by 
the processor via an interrupt request line. In this step the pro- 
cessor can determine which device made the request. The 
method that is used to determine which device to service is 
directly related to the interrupt structure of the machine. The 
different types of interrupt structures will be discussed in more 
detail below. 


Save Status 


_The goal of this step is to make the interrupt sequence trans- 


parent to the interrupted process. Therefore, the processor saves 


a minimum set of flags and registers that may be changed by the’ 


interrupt service routine, so that alter the service routine is 
finished they may be restored. = 


The minimum set of flags and registers would be those which 
will be destroyed in the transfer of control from the current pro- 
cess to the interrupt service routine. It is then the responsibility 
of the service routine to save any other registers which it might 
change. The minimum set of flags and registers might include 
the Program Counter, Overflow Fiag, Sign Flag, Interrupt Mask, 
etc. The minimum set also includes any register or tlag that 
needs to be saved that the interrupt service routine cannot 
access. 


Interrupt Masking 


This step can overlap some of the other steps. For the first few 
steps ol the sequence all interrupts are masked out so that no 
interrupt may occur before the processor status is saved. The 
mask is then usually set to accept interrupts of higher prionty. 
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Some machines allow the service routine to celectively enabla 
er disable interrupts also. There may be different variations to 
this step depending upon the appt:cation, 


interrupt Acknowledge 


At some point tha processor must acknowledge the interrupt 
being seniwed $0 inat the interrupting device Ancws thal itis trae 
to continue tts task. The processor can acknowizdge several 
ditierent ways. One of the ways is to have a ne cevoted ta 
interrupt acknowledge. Anolher method relies upon the inter- 
Tudiing davice recogniz'ng an acknewiedge when the cause of 
tha interrupt is serviced. 


Some processor designs also use this signal as a request for the 
inierrupting dcvice to send an 1.0. down the data bus. This as- 
peci will be discussed in more detail below. 


interrupt Service Routine 


At this po.nt the processor can call the irtezrupt service routine. 
The address of the routine can be obtaincd several ways de- 
pend:rg ucon the system architecture. The most trivial is when 
tere ts only one routme which polls each device to find out 
which one mterrupied Some designs require that the interrupt- 
ing device pul an adtiress on tne data bus so that the processor 
can Store it in &$ Eregram counter and branch to it, Olher de- 
signs use an ID. number derived from the priority of the interrupt 
and put it through a mappicrg PROM or look-up table in memory 
in order to odtuin the aovress of the senice routine. 


Restore and Rerum ‘ aoe: 


Aher the interrupt service routine nas retumed via some vania- 
tion of an Interrupt Return instruction, the processor should re- 


Figure 1. Single Request, Multiple Poll. 


store all the registers and flags that were saved provious to ine 
interrupt routine. tf this is done correctly, the processor should 
have the same status as before the interrupt was recognized. 


INTERRUPT STRUCTURES “9 


There arc several interrupt structures that can be implemented, 
As usual there Is a trade-off betwaen hardware and software (or 
firmware). Listed below are some ol the more common struc- 
tures used. The parucutar structures vary in the way that the 
processor determinas which device made the interrupt gequest. 


Single Request, Multiple Poll 


In this structure there is one request line which is shared among 
all interrupting devices. When the processor recognizes an inter- 
rupt request It polls all the devices to find tha interrupting device 
(see Figure 1) Pnonty is introduced via the order in which the 
devices are potied. This scheme also allows dynamic real‘oca- 
tion of p-ionty. 


Single Request, Daisy Chain Acknowledge 


In this structure there is one request line which is shared. When 
the processor receives an interrupt it sends out a signal 
acknowledging tne interrupt. The acknowledge signal is passed 
from VO device to VO device until the interrupting device re- 
cetves the signal. At this point the interrupting Cevice identhes 
itself by puiting an 1.0. number on the data bus (see Figure 2). 
This structure requires less software, but bas a stabc pnonty 
associated wilh each interrupting device. There is also a tme 
deiay associated with daisy chain acknowledge structure be- 
cause in each device INTFA signal has to pass through several 
gate delays. ¥ : 


DG tunetc 


(NTERNUPT 
ACKNOWLEDGE 


Figure 2. Single Request, Daisy Chain Acknowledge. 


Multiple Request 


This structure features one line per priority level (see Figure 3). 
The multipie line structure gives the fastest response time since 
the interrupting device can be identitied immediately. it also re- 
sults in simpler interfaces in the peripheral units, in general, a 
single interrupt request flip-flop. This structure allows for the 
possibility of having a mask bit associated with each priority 
level (device). The trade-olf of this circuit is a wider bus and a 
hmit of one penpheral per priorrty level. 


Multiple Request, Daisy Chain Acknowledge 


‘This structure combines the Single Request/Caisy Chain 


Acknowledge with the Multiple Request structure (see Figure 4). 
For each interrupt request line there is an interrupt acknowledge 
lne which is connected to a string of devices in a daisy chain 
fashion. When the appropriate device receives the interrupt 
acknowledge, tt puts an 1.0. number on the data bus. 


The advantage of this structure Is that a lot (more than available 
interrupt levels) of devices may be handled by breaking them up 


- Ra INTERRUPT 


‘1. CPU 
INTERRUPT 
VECTOR 


into short daisy chains. This gives a shorter access time than a 
Pure daisy chain with less hardware than an interrupt request 
line per device. This advantage is that each device must be 
intelligent to pass on the acknowledge signal which requires 
more hardware in each device. 


PRIORITY SCHEMES 


When handling asynchronous requests one must assume that 
sometimes two or more requests can happen simultaneously. In 
order to handle this situation, there must be some sort of priority 
scheme implemented to pick which request is serviced first. 


The two most common priority schemes are the static and the 
rotating structures. In the static structure, ail the interrupt levels 
are ordered from the lowest priority to the highest priority. This 
can be fixed in sofiware or hardware and is usually permanent. 


In the rotating structure the possible interrupt requests are as- 
ranged in a circle. There ts a pointer which points to the lowest 
Priority interrupt. The priority of each interrupt increases as one 
travels around the circle, with the highest priority interrupt being 


INTERRUPT 
LEVEL 
Cal 


Figure 4. Multiple Requests, Dalsy Chain Acknowledge. 


209 


————— Se 


€ fer ane 


210 


adjacent to the lowest priority interrupt. The lowest priority inter- 
Tupt pointer is changed to point at the interrupt that was just 
serviced. This structure is advantageous when all interrupts 
have similar pnority and service bandwidth requirements. 


NESTING 


Nesting allows only higher pnority interrupts to interrupt a pro- 
cessing interrupt service routine. Nesting requires fencing olf 
equal and lower level interrupts. Fencing requires that the inter- 
rupt structure hold the value of the highest priority interrupt being 
serviced. This can be implemented with a Status Register that 
hoids the value as a binary encoded number or in other systems 
as an In-Service Register with a different bit associated with 
each interrupt. 


Whether nesting is performed in microcode or not, all computers 
must have machine instructions to enable and disable interrupts 


and set and clear mask bits. With these instructions, Interrupt 
handlers can be written to accomplish nesting of interrupts al 
though fess efficiently than when done with microcode and 
hardware. in low-end computers, the interrupt structure only 
Prionitizes interrupts leaving nesting to the software interrupt 
handlers. 


A UNIVERSAL HARDWARE INTERRUPT STRUCTURE 


While designing a hardware interrupt structure, the designer 

should consider the specific functions that are to be achieved. 

This provides for systom optimization in not only hardware but 

also software. in-the folowing-paragraphsis a step by- step 

development of a general purpose interrupt structure as related. 
to the-design concepts invoived. 

Multiple Interrupt Request Handling 


Since interrupt requests are generated from a number of 
sources, the interrupt structures ability to handle interrupt re- 
quests from several sources is important. 


As implemented in Figure 5, the register configuration allows the 
hardware to handle interrupt requests trom several sources. The 
first column of registers catches the asynchronous interrupt re- 
quest. The second column of registers synchronizes the re- 
quests-with respect to the system. After the interrupt is serviced, 
one of the CLR lines can be used to selectively clear the inter- 
tupt request. 


Interrupt Request Prioritization 


Since the processor can service only one interrupt request at a 
time, the interrupt structure should have the ability to priontize 
the requests and determine which has the highest priority. As 
shown In Figure 6, a priority encoder can be put on the output of 
the interrupt storage registers. The priority encoder will identify 
the highest Interrupt request as a binary encoded number. 


Dynamic Interrupt Request Masking 


The ability to selectively inhibit or “mask” Individual interrupt 
requests under program control is desirable. For example at 
times it may be important to inhibit all interrupts except Power 
Failure. As shown in Figure 7 this is realized by ANDing the 
output of a mask register with the output of the interrupt storage 
registers. Therelore, the mask register can be used to select 
which interrupt requests will pass through to the rest of the 
hardware. 


Interrupt Request Clearing 3 


Flexibility in te method of clearing the interrupt allows different 
modes of interrupt system operation. Of particular value are the 
abilities to clear the interrupt currently being serviced or clear all 


‘Interrupts. 


Figure 6. 
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This is implemented in Figure 8 by use of the Vector Hold reg- 
ister on the output of the Priority Encoder. This register holds the 
latest interrupt request that was recognized. Before another 
interrupt request is recognized, the output of the Vector Hold 
registe: can be fed through some clear control fogic to selec- 
tively clear the old interrupt. —_ 


Interrupt Request Priority Threshold 


The ability to establish a priority threshold is vatuable. In this 
type of operation, onty those interrupt requests which have 
higher priority than a specified threshold priority are accepted. 
The threshold priority can be defined by microprogram or can be 
automatically established by hardware at the interrupt currently 
being serviced plus one. This automatic threshold prevents mul- 
tple interrupts from the same source. —- 


f- 


PRIORITY 
ENCODER 


Figure 7. 


This feature is implemented in Figure 8 using an incrementer 


and status register which is compared with the current request. - 


Each time an interrupt is recognized, the status register is up- 
dated with one plus the current level. 


Interrupt Service Routine “Nesting” 


This feature allows an interrupt service routine for a given pri- 
Ority request to be interrupted in tum by a higher priority interrupt 
request. This can be achieved by saving the status register be- 
fore each interrupt is serviced and restoring it afterwards. 


Microprogrammability and Hardware Modularity . 


These last two design concepts bring us to the Vectored Priority 
Interrupt controller, the Am2914. The Am2914 is a modular inter- 
rupt system block which Is beneficial in two ways. First, 
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Figure 9. Am2914 Block Diagram. ve thoy ITI L1G! 


hardware modularity provides expansion capability. Additional 
modules may be added as the need to service additional re- 
quests arises. Secondly, hardware modularity provides a struc- 
tural regularity which simplifies the system structure and also 
reduces the number of hardware part numbers. —_ 


The Am2914 is microprogrammable, which permits the con- 
struction of a general purpose or “universal” interrupt structure 
which can be microprogrammed to meet a specific application's 
requirement. The universality of the structure allows standardi- 
zation of the hardware and amonization of the hardware de- 
velopment costs across a much broader user base. The end 
result is a flexible, low cost interrupt structure as showr in 
Figure 9. 


PROGRAMMING THE Am2s14_ 


The Am2914 is centrolled by a four-bit microinstruction field Ig-I5. 
The microinstruction is executed if jE (Instruction Enable) is 
LOW and is ignored if IE is HIGH, allowing the four | bits to be 
shared with other functions. Sixteen different microinstructions 
are executed. Figure 11 shows the peiminsusiche and the 


microinstruction codes. : tas teres ai 


In this microinstruction set, the Master Clear microinstruction is 
selected as binary zero so that during a power-up sequence, the 
microinstruction register in the microprogram control unit of the 
central processor can be cleared to all zeros. Thus, on the next 
clock cycle, the Am2914 will execute the Master Clear function. 
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Figure 10. Am2914 Logic Symbol. 
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‘| MICROINSTRUCTION 
DESCRIPTION 

MASTER CLEAR 
CLEAR ALL INTERRUPTS 


CLEAR INTERRUPTS FROM , 
M-BUS “3 ‘onit 


' CLEAR INTERRUPTS FROM MASK 
REGISTER ju 


CLEAR INTERRUPT, LAST, 
VECTOR READ 


READ VECTOR : i BS 
READ STATUS REGISTER 
READ MASK REGISTER - 
SET MASK REGISTER 
LOAD STATUS REGISTER - “— - 
BIT CLEAR MASK REGISTER 

BIT SET MASK REGISTER 

CLEAR MASK REGISTER 
DISABLE INTERRUPT REQUEST 
LOAD MASK REGISTER 
ENABLE INTERRUPT REQUEST 


Figure 11. Am2914 Microinstruction Set 


. ' 
This includes clearing the Interrupt Latches and Register as well 
as the Mask Register and Status Register. The LGE flip-flop of 
the least significant group is set LOW because the Group Ad- 
vance Receive input is tied LOW. All other Group Advance Re- 
ceive inputs are tied to Group Advance Send outputs and these 
are forced HIGH dunng this instruction. This clear Instruction 
also sets the Interrupt Request Enable flip-flop so ihat a fully 
interrupt dnven system can be easily initiated from any interrupt. 


The Clear All interrupts microinstruction clears the Interrupt 
Latches and Register. : = 


The Clear Interrupts from M-Bus microinstruction clears those 
Interrupt Latches and Register bits which have corresponding 
M-Bus bits set equal to one. 


The Clear Interrupts from Mask Register microinstruction clears 
those Interrupt Latches and Register bits which have cor- 
responding Mask Register bits set equal to one. The M-Bus is 
used by the Am2914 during the execution of this microinstruction 
and must be floating. 


The Clear Interrupt, Last Vector Read micromstruction clears 
the Interrupt Latch and Register bit associated with the last 
vector read. T 


The Read Vector microinsiniction ts used to read the vector 
value of the highest priority request causing the interrupt. The 
vector outputs are three-state drivers that are enabled onto the 
is instruction. This microinstruction also automatically loads the 
value “vector plus one" into the Status Register. in addition, this 
instruction sets the Vector Clear Enable flip-flop and loads the 
current vector value Into the Vector Hold Register so that this 
value can be used by the Clear Interrupt, Last Vector Read 
microinstruction. This allows the user to read the vecior as- 
sociated with the interrupt. and at some fater time clear the 


Interrupt Latch and Register bit associated with the vector read. ~ 


During the Read Status Register microinstruction, the Status 
Register outputs are enabted onto the Status Bus (So-S2). The 
Status Bus 1s a three-bit, bi-direct:onal, three-state bus. 


The Read Mask Register microinstruction enables the Mask 
Register outputs onto the bi-directional, three-state M-Bus. 


The Set Mask Register microinstruction sets all the bits in 
the Mask Register to one. This results in all jnterrupis being 
inhibited. 


The Load Status Register microinstruction loads S-Bus data into 
the Status Register and also loads the LGE flip-flop from the 
Group Enabie inpuL 


The Bit Clear Mask Register mizroinstruction may be used to 
Selectively clear individual Mask Register bits. This micro- 
instruction clears those Mask Register bits which nave cor- 
responding M-Bus bits equal to one. Mask Register bits with 
corresponding M-Bus bits equal to zero are not affectad. 


The Git Set Mask Register microinstruction sets those Mask 
Register bits which have corresponcing M-Bus tits equal to one. 
Other Mask Register bits are not affected. 


The entire Mask Register is cleared by the Clear Mask Register 
microinstruction. Ths enables ali interrupts sut;ect lo the Inter- 
rupt Enable ihp- flop and the Status Register. ‘ : 


All Interrupt Requests may be disabled by execution of the Dis: 
aole Interrupt Request mucroinstruction. This microinstruction 
resets an Interrupt Request Enable flip-flop on the chip. 


The Load Mask Register microinstruction loads data from the 
three-state, bi-directional M-Bus-into the Mask Register. 


The Enable interrupt Request microinstruction sets the Interrupt 
Enable thp-flop. Thus, Interrupt Requests are enabled subject to 
the contents of the Mask and Status Registers. | 
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Am2914 BLOCK DIAGRAM DESCRIPTION 


The Am2914 biock diagram is shown in Figure 9. The Micro- 
instruction Decode circuitry decodes the Interrupt Microinstruc- 
tions and generates required control signals for the chip. 


The Interrupt Register holds the Interrupt Inputs and is an 
eight-bit, edge-tnggered register which is set on the rising edge 
ot the CP Clock signal if the Interrupt Input is LOW. 


The Interrupt latches are setreset latches. When the Latch 
Bypass signal is LOW. the latches are enabled and act as nega- 
tive pulse catchers on the inputs to the Interrupt Register. When 
the Latch Bypass signal is HIGH, the Interrupt latches are 
transparent. 


The Mask Register holds the eight mask bits associated with the 
2ight interrupt levels. The register may be loaded from or read to 
the M-Bus. Also, the entire register or individual mask bits may 
be set or cleared. 


The Interrupt Detect circuitry detects the presence of any un- 
masked Interrupt Input. The eight-input Priority Encoder deter- 
mines the highast priority, non-masked Interrupt Input and forms 
a binary coded interrupt vector. Following a Vector Read, the 
three-bit Vector Hold Register holds the binary coded inter- 
tupt vector. This stored vector can be used later for clearing 
interrupts. 


The three-bil Status Register holds the status bits and may be 
loaded from or read to the S-Bus. During a Vector Read. the 
Incrementer increments the interrupt vector by one, and the re- 
sull is clocked into the Status Register. Thus, the Status Reg- 
ister points to a level one areater than the vector just read. 
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The three-bit Comparator compares the Interrupt Vector with the 
contents of the Status Register and indicates if the Interrupt 
Vector is greater than or equal to the contents of the Status 
Register. 


The Lowest Group Enabled Flip-Flop is used when a number of 
Am2914's are cascaded. In a cascaded system, only one Low- 
est Group Enabled Flip-Flop is LOW at a time. It indicates the 
eight interrupt group, which contains the lowest priority interrupt 
level which will be accepted ard is used to form the higher order 
Status bits. 


The Interrupt Request and Group Enable logic contain various 
gating to generate the Interrupt Request, Parallel Disable, Rip- 
ple Disable, and Group Advance Send signals. 


The Status Overflow signal is used to disable all interrupts. it 
indicates the highest priority interrupt vector has been read and 
the Status Register has overilowed. 


The Clear Control logic generates the eight individual clear sig- 
nals for the bits in the Interrupt Latches and Register. The Vector 
Clear Enable Flip-Flop indicates if the last vector read was from 
this chip. When it is set it enables the Clear Control Logic. 


The CP clock signal is used to clock the Interrupt Register, Mask 
Register, Status Register, Vector Hold Register, and the Lowest 
Group Enabled, Vector Clear Enable and Status Overtiow ica 
Flops, all on the clock LOW-to-HIGH transition. 


CASCADING THE Am2914 


A number of input/output signals are Srewiciod for cascading the 
Am2914 Vectored Prionty Interrupt Encoder. A definition of 
these V/O signals and their required connections foliows: 


Group Signal (GS) — This signal is the output of the Lowest 
Group Enabled flip-flop and during a Read Status micro- 
instruction is used to generate tie high order bits of the Status 
word. 


Group Enable (GE) — This signal is one of the inputs to the 
Lowest Group Enable flip-flop and is used to load the flip-flop 
durnng the Load Status microinstruction. 


Group Advance Send (GAS) — During a Read Vector micro- 
instruction, this output signal is LOW when the highest priority 
vector (vector seven) of the group is being read. In a cascaded 
System Group Advance Send must be tied to the Group Ad- 
vance Receive input of the next higher group in order to transfer 
status information. 


Group Advance Receive (GAR) - During a Master Clear or 
Read Vector microinstruction, this input signal is used with other 
intemal signals to load‘the Lowest Group Enabled Ilip-flop. The 
Group Advance Receive input of the lowest priority group must 
be tied to ground. 


Status Overflow (SV) - This output signal becomes LOW after 
the highest prionty vector (vector seven) of the group has been 
read and indicates the Status Register has overflowed. It stays 
LOW until a Master Clear or Load Status microinstruction is 
executed. The Status Overflow output of the highest priority 
group should be connected to the interrupt Disable input of the 
Same group and serves to disable all interrupts until new status 
is loaded or the system is master cleared. The Status Overflow 
outputs of lower pnonty groups should be left open (see Fig- 
ure 14), 

Interrupt Disable (ID) - When LOW, this input signal inhibits the 
Interrupt Request output from the chip and also generates a 
Rippie Disable output. 


Ripple Disable (RD) — This output signal is used only in the 
Ripple Cascade Mode (see below). The Ripple Disable output is 
LOW when the interrupt Disable input is LOW, the Lowest 
Group Enabled flip-flop is LOW, or an interrupt Request is gen- 
erated in the group. in the nipple cascade mode, the Ripple 
Disable output Is tied to the Interrupt Disable input of the next 
lower priority group (see Figure 13). 


Parallel Disable (PD) — This output is used only in the parallel 
cascade mode (see below). It is LOW when the Lowest Group 
Enabled flip-fiop is LOW or an Interrupt Request is gengrated i in 
the group. It is not affected by the Interrupt Disable input. 


CASCADING CONFIGURATIONS 


Asingle Am2914 chip may be used to prioritize and encode up to 
eight interrupt inputs. Figure 12 shows how the above cascade 
lines should be connected in such a single chip system. 


Figure 12. Cascade Lines Connection for 
Single Chip System. 


The Group Advance Receive and Group Enable inputs should 
be connected to ground so that the Lowest Group Enabled Ifip- 
flop is forced LOW during a Mdster Clear or Load Status micro- 
instruction. Status Overflow should be connected to Interrupt 
Disable in order to disable interrupts when vector seven Is read. 
The Group Advance Send, Ripple Disable, Group itp and 
Parallel Disable pins should be teft open. 


The Am2914 may be cascaded i in either a Ripple Cascade Mode 
or a Parallel Cascade Mode. tn the Ripple Cascade Mode, the 
Interrupt Disable signal, which disables lower priority interrupts, 
is allowed to ripple through lower priority groups. Figures 13, 16, 
and 17 show the cascade connections required for a ripple cas- 
cade 32 input interrupt system. 


In the parallel cascade mode, a parallel! lookahead scheme is 
employed using the high-speed Am2902 Lookahead Carry 
Generator. Figures 14, 15, and 17 show the cascade‘connections 
required for a parallel cascade 32-input interrupt system. For this 
application, the Am2902 is used as.a lookahead interrupt disable 
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Figure 13. Interrupt Disable Connections for 
Ripple Cascade Mode. 


generator. A Paraltel Disable output from any group results in the 
disabling of all lower priority groups in parallel. Figure 15 shows 
the Am2902 logic diagram and equations. 


In Figures 16 and 17 the Am2913 Priority interrupt Expander is 
shown forming the high order bits of the vector and status, re- 
spectively. The Am2913 is an eight-line to three-line priority en- 
coder with three-state outputs which are enabled by the five 
output control signals Gt, G2, G3, G4, and G5. In Figure 16, the 
Am2913 is connected so that its outputs are enabled during a 
Read Vector instruction, and in Figure 17 the Am2913 is con- 
nected to microinstruction bits so that its outputs are enabled 
during a Read Status Instruction. The Am2913 logic diagram and 
truth table are shown in Figure 18. 


The Am25LS138 three-line to eight-line Decoder also is shown in 
Figure 17. It is used to decode the three high order status bits 
during a Load Status instruction. The Am25LS138 logic diagram 
and truth table are shown in Figure 19. 


Am2914 IN THE Am2900 SYSTEM 


The block diagram of Figure 20 shows a typical 16-bit mini- 
computer architecture. The Am2914 is the heart of the Interrupt 
Control Unit as shown at the bottom of the block diagram. It 
receives its microinstructions from the Computer Control Unit. 
The mask, Status and Interrupt vector information are passed on 
the data bus. The interrupt request line from the Am2914 input 
into the next microprogram Address Control unit where it can be 
tested to determine if an interrupt request has been made. 


Figures 21 and 22 show the detailed hardware design of two 
example interrupt control units (ICU's) for an Am2900 Computer 


Figure 14. interrupt Disable Connections for Parallel Cascade Mode. 
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Figure 15. Am2902 Carry Look-Ahead Generator Logic Diagram and Equations. 


Figure 16. Vector Connections for both the Parallel and Ripple Cascade Modes. 


System. Figure 21 shows an eight interrupt level ICU, and Figure 
22 shows an ICU which has sixteen levels. In both designs, the 
Am2914 Instruction inputs and Instruction Enable input are driven 
by the Ip.5 field and IE bit, respectively, of the Microinstruction 
Register. Note that Am2914 Instruction inputs are enabled only 
when the JE bit Is LOW. Therefore, the Ig. field of the Micro- 
instruction Register may be shared with another functional unit 
of the computer such as the ALU. 


The Latch Bypass input is’ shown connected to ground so that a 
Low-going pulse will be detected at any of the Interrupt inputs. 
The designer has the option of connecting the Latch Bypass input 
to a pull up resistor connected to +5 volts. This makes the inputs 
low level sensitive. They are clocked in by each system clock. Itis 
therefore implied that the processor will have to acknowledge the 
interrupt so that the interrupting device will know when to release 
the interrupt requést line. 


Figura 17. Group Signal, Group Enable, Group Advance Send, Group Advance Receive and Status Connections for Both 
the Parallel and Ripple Cascade Modes. 
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Figure 18. Am2913 P ority Interrupt Expander Logic Diagram and Truth Table. 
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Figure 20. A Generalized Computer Architecture. 
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Figure 22. 16 Level interrupt Control Unit for Am2900 System. 
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In Figures 21 and 22, the Status and Mask inputs/outputs are 
connected to the data bus in a bi-directional configuration so that 
Status and Mask Registers may be loaded from or read to the 
data bus with appropriate Am2914 instructions. This gives the 
designer two possibilities which could be very advantageous. 


Number one is the ability to store the Status and Mask information 
on a stack in memory. This is very advantageous when doing 
nested interrupts. Secondly, it allows the designer to construct 
machine instruction that can modify these two registers. This is 
very important to the system programmer who Is involved in 
writing software to manage the interrupts. 


For the eight level ICU of Figure 21, the Status Overflow output is 
connected to the Interrupt Disable input, and the Group Advance 
Receive and Group Enable inputs are connected to ground, as 
previously described. 


For the 16 interrupt level ICU of Figure 22, the Parallel Disable 
output of the higher priority group serves as the high order vector 
bit. An Am2913 Priority Interrupt Expander is gated by the 
Am2914 instruction lines so that its output is enabled onty during a 
Read Status instruction, and is used to encode the high order bit 
of the status. An inverter sutfices to decode the high order bit of 
the status bit during a Load Status Instruction. As described 
previously for a npple cascade system, the Group Advance Re- 
ceive input of the next higher priority group; the Ripple Disable 
output is connected to the Interrupt Disable input of the next lower 
Prionty group; the Status Overtiow output of the highest priority 
group is connected to the Interrupt Disable input of the same 
group, and the Group Advance Receive input of the lowest priority 
group is connected to ground. 


In both designs, two Am29751 32-word by 8-bit PROM's with 
three-state outputs are used to map the Am2914 Vector outputs 
into a 16-bit address vector. The PROM outputs are connected to 
the data bus. When a Read Vector Instruction (Am2914) is exe- 
cuted, ihe address vector is available to be used either as the 
address of the next instruction or a location to find the address of 
the next instruction to execute. 5 


Figure 23 shows a design where the address vector from the 
Mapping PROM can be clocked into a register in the Am2903's. 
The registers in the Am2903's would be split between general 
purpose, scratch, stack pointers and Program Counter registers. 


The address vector also may be gated directly to the “D" inputs of 
the Am2911 Microprogram Sequencer as shown in Figure 24, 
and used as the start PROM address of a microinstruction inter- 
rupt service routine. This method would be most useful in a 
controller application. This method would trade faster service for 
a bigger microprogram that accommodates all the code to service 
each individual interrupt. 


FIRMWARE EXAMPLE FOR Am2914 INTERRUPT SYSTEM 


The software for handling interrupt requests fs on two levels. 
The first level to come into play is the microprogram level. This is 
the level at which the request is recognized and the program 
counter is manipulated to start execution of a machine level 
interrupt service routine which is the second level. When the 
machine level interrupt service rouline is finished, some form of 
a Return interrupt Instruction 1s executed. The microcode for the 
return instruction manipulates the program counter so that 
execution of the current machine program previous to the re- 
quest is restored as shown in Figure 25. 

This example is concemed with the microprogram level. This 
microcode goes along with the hardware shown in Figure 23. In 
this example the code is shown in the form of Flow Charts be- 


cause the actual microprogram format will vary from machine to 
machine. 


The important features to notice that have a direct relevance to 
the firmware are the Latch Bypass and where the Mask, Status 
and Vector busses go. For this example, the Latch Bypass Is 
LOW making the Interrupt Latches latch up on a negative going 
pulse. The Mask and Status busses go to the data bus aliowing 
the Status and Mask data to be transferred to and from memory. 
The Vector bus passes through a mapping PROM to the data 
bus where it can be read Into the Program Counter contained in 
the Am2903's. The PROM contains addresses of service 
routines which correspond to the different interrupt levels. 


Another retevant fact, important to understanding the firmware is 
that the interrupt mechanism is limited to handle interrupts on 
the machine level. 


As shown in Figure 26a, the first thing that happens in the fetch 
routine (written in microcode) is a conditional subroutine call that 
will be taken if an interrupt request is present. This happens 
before the current machine instruction is fetched and the pro- 
gram counter is incremented. 


In the interrupt routine (shown in Figure 26b) a microprogram 
subroutine is first called to push the program counter onto the 
system stack. This is done so that the program counter can be 
restored in order to resume execution of the machine program 
after the interrupt service routine is done. The next thing that is 
saved on the system stack is the contents of the Am2914 Status 
Register. This is done because the status register which contains 
the priority level that would be serviced pnor to the interrupt, will 
be restored after the interruptis serviced. This maintains a nested 
interrupt structure (fence). 


After saving the program counter and status register, the vector 
is read out of the Am2914 through the mapping PROM to obtain 
the address of the machine interrupt service routine. The ad- 
dress is then read into the program counter which resides in the 
Am2903's. When the Vector is read, the interrupt request priority 
plus one is automatically put into the status register by the 
Am2914 so that all interrupt requests of lower priority than the 
one being serviced are ignored. This is often referred to as 
moving the fenca up. Since the vector has been read and the 
new address is in the program counter, the interrupt request can 
be cleared from the interrupt register via the Clear Interrupt/Last 
Vector Read instruction. At this point a jump is made to the 
Fetch routine which will now fetch the first instruction of the 
machine Interrupt Service routine. 


The last instruction that the machine level interrupt service exe- 
cules is an Interrupt Retum. This will in tum call Retum Interrupt 
microprogram. The status is first popped off the system stack 
and loaded back into the status register. This restores the Inter- 
tupt Fence. The program counter is then popped off the system 
stack and loaded into the program counter register. This re- 
stores the program counter to point to the instruction that was 
going to be executed when the interrupt request occurred. 


TIME DELAY WHEN USING THE Am2914 


An aspect that should be covered when using any part is how it 
will fit into the system timing: because the cycle time of the 
system will be as long as the longest delay path in the machine. 
Shown in Figure 27 is the longest delay path through the 
Am2914 for the previous 16-bit computer example. The calcula- 
tions were using both typical and worst case values at 25°C and 
5.0V. ° 


The longest delay path for the system where the vector from the 
mapping PROM feeds into the “D™ inputs of the Am2910'is 


Figure 24, 
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Figure 25. Machine Level Instruction Flow During Interrupt 
Request. 
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Figure 26b. Call Interrupt Service Routine Microprogram 
Flow Chart. 
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Figure 26a. Flow Chart for a Simplified Microprogram Fetch a ee eee 
Routine. Figure 26c. Roturn Interrupt M.croprogram Flow Chari. } 
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Figure 28h. 


shown in Figure 28. This path is much longer because of the two 
PROM's that have to be accessed. Therefore, there may be a 
trade-off of slightly longer system cycle time for faster service of 
interrupts via service routines in microcode. 


For some systems the delay time shown in Figure 28b may be 
too long. Therefore, the designer can split the delay time into 
parts by putting a register between the Am2914 and the mapping 
PROM as shown in Figure 28c. When done in two system ciock 
cycles, the delay time will be as shown in Figure 28f. 


Figure 28d shoWs the delay path from the Interrupt Request 
Register through the Condition Code MUX to the Am2910. The 
time calculations are shown in Figure 28g. Again, for some sys- 
tems, this path may be too long. Therefore, as shown above, this 
path may be broken ‘n two, which is shown in Figure 28e. This 
will result in two system clock cycles. The delay involved in each 
cycle Is shown in Figure 28h. 


ANOTHER EXAMPLE OF Am2900 SYSTEM 
USING THE Am2914 


As shown in Figure 29, this example varies in the way that the 
interrupt request is recognized by the microprogrammed 


machine. In this example the interrupt request line for the 
Am2914 enables or disables the MAP signal going to the map- 
ping PROM. When an interrupt request is present and a Jump 
Map instruction is executed, the output of the mapping PROM 
remains tri-stated; and the bus connected to the “D" inputs of 
the Am2910 is HIGH because of the pull-up resistors. Theretore, 
the microprogram will start executing at the highest location in 
microprogram memory when an interrupt request is present. At 
this location a Jump Instruction to the microprogram interrupt 
service routine could be placed. The microcode is wntten so that 
the only time a Jump Map instruction is executed is at the end of 
the Fetch microprogram routine as shown in Figure 30a. 


In the previous example the interrupt request was recognized 
before the program counter is incremented after which the Jump 
Map instruction is executed. Wnen the Jump Map is executed, 
either the instruction Is executed or an interrupt request is ser- 
viced. Therefore, when the Retum Interrupt machine instruction 
Is executed, the program counter needs to be backed up via 
microcode, as shown in Figure 30b, in order to refetch the 
machine instruction which was fost. This also dictates that the 
program counter have a path to an incrementer/decrementer or, 
ALU, which in this example is handled by puttng the pregearn 
counter in the Am2903's. 


MICROPROGRAM LEVEL INTERRUPT EXAMPLE 


Some high-speed contro! applications require extremely fast 
interrupt response. While it may orcinarily be desirable to com- 
plete an entire processing sequence (such as executing a me 
croprogram for a macroinstruction) prior to testing for the inter- 
rupt and allowing it to occur, it is not ahvays possibie to achieve 
the required interrupt response time desired. If this 1s the case, 
microinstruction level interrupt handling must be employed. The 
technique described below has a maximum latency of three mi- 
crocycles which can be 450-600ns total. Impiementation is 
straightforward using the Am2910 Microsequencer, a 40-pin LS} 
device that can control 4096 words of microprogram at a 150ns 
cycle time, and a few extra MSI and SSI packages. In this appli- 
cation, the Am2910 is configured in tts standard architecture. 
The additional logic does not influence the normal system cycie 
time. 


It microlevel interrupt handling is to be employed, logic must be 
provided to generate a substitute microprogram acdress corre- 
sponding to the location of the interrupt service routine. In the 
event of a microlevei interrupt, the sequencer acdress oulputs, 
are tri-stated and the substitute address is placed on the micro- 
program address bus, Causing the next microinstruction feich to 
be determined by the interrupt control vector generator. While 
this is happening, steps must be taken with the Am2910 to in- 
sure that the interrupted routine can be properly restored. To 
understand this procedure, it will be necessary to examine the 
Am2910 in more detail. 


Referring to Figure 31, the microprogram address bus is dnven 
by the Y¥ outputs of the Am2910 through a ti-state butter than 
can be disabled by means of the OE input. The address ts 
selected in a multiplexer from a direct input, from a register 
counter, from a push/pop stack, or from a microprogram counter 
register. The microprogram counter register is commonly used 
as the address source when executing the next microinstrucuon 
in sequence. Whenever an address appears at the mulupiexer 
outputs, it is incremented and presented to the microprogram 
counters inputs. At the nsing edge of the cic, this new address 
that is current address-plus-1 is joaded into the microprogram 
counter and a microprogram access begins at this address. 
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Figure 30a. Return Interrupt Microprogram 
for Second Example. 


REGISTER! J] 
COUNTER 
R 


> AM 


MICROPROG! 
COUNTER- 
REGISTER O° 


3 
= 
o 
Ee 
Go 
=z 
a 
< 
is} 
w 
a 


SELECT/CLEAR 


PUSH/POP/HOLO/CLEAR 


: > 12-BIT DATA PATH 


CONTROL PATH 


Figure 31. Am2910 Block Diagram. 
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Note that at this time, whatever was fetched at the previous’ 
address was loaded into the microword register for execution. 

Thus, the microprogram sequencer is always looking for the 
address of the next microinstruction to be executed (while a 
Previously fetched microinstruction is residing in the microword 
register). Subroutine and microprogram foops may be ac- 
complished by using the stack and the register counter. Ae- 

gardiess of what is selected as source of next address, the 

selected address will be incremented and presented to the 

microprogram counter. So to accomplish a microprogram 
branch, one would simply setect the D inputs for a branch ad- 

dress for one cycle, then the next address source could be 

Switched back to the program counter on the next cycie which 

would then contain the branch address plus 1. 


This is a carry input to the incrementer which is normally tied 
HIGH. In the case of a microlevel interrupt, the microprogram 
sequencer will not determine the address of the next microin- 
struction to be executed. Instead the sequencer output will be 
tn-stated and a substitute address will be placed on the bus. The 
sequencer continues to operate in a normal fashion with its mul- 
tip!exer output being incremented and presented to the micro- 
Program counter reaister. It must now be noted that the instruc- 
tion located at the address then coming out of the muluplexer 
Outputs will not be execuied but rather the next microinstruction 
to be executed will be determined by the interrupt vector 
generator. It would therefore, be wrong to increment this micro- 
Program address but rather it must be saved intact in order to 
push it onto the stack for access during interrupt retum. This is 
easity accomplished in the Am2910 by grounding the carry input 
to the incrementer simultaneously with three-stating the se- 
quencer output. Then the mulliplexer output will be stored in the 
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microprogram counter register and on the next microcycie the 
Am2910 must be told to push in order to preserve this address 
on the stack. 


This carry-in input is all important and exists on all Advanced 
Micro Devices’ microprogram sequencers. Unless the carry-in is 
grounded, whatever address was in the multiplexer output when 
the sequencer output was tri-stated Is Incremented and an in- 
struction is missed in the interrupted routine. This, of course, 
would likely be disastrous. The key to this microinterrupt 
technique is that the address of the unexecuted instruction 
(when the Am2910 was (ri-stated and a substitute address 
supplied) is preserved by inhibiting the increment via the cany 
input, so the address is passed on intact to the microprogram 
counter. If the microinterrupt is to be more than one cycle long, 
the microprogram counter must be pushed so as to save the 
retum address. Otherwise, @ “continue” may be used to retum 
from the interrupt on the very next cycle. in this event the mic- 
rointerrupt effectively inserts one instruction in the stream. 


Figure 32 is the block diagram of a hardware design that imple- 
ments the above concept. The SYNC;/CONTROL and INTER- 
RUPT CONTROL/VECTOR GENERATOR logic are shown in 
detail in Figure 33. Part of the Am2918 and both ‘LS74 Flip- 
Flops are used to synchronize the recognition of the asynchron- 
ous interrupt request as shown in Figure 34. The interrupt re- 
quest arrives at the interrupt input. On the next clock cycle itis 
clocked into the Am2918. In the following clock cycle a pulse 
that is one system clock cycle tong is put out by the flip-flop par 
FF1 and FF2. The puise is used to disable the carry input o! the 
Am2910, tri-state the output of the Am2910, and enable the 
jump vector onto the input of the PROM. The vector indexes into 
a table in microprogram memory that contains "JUMP SUB- 
ROUTINE” instructions to different interrupt service routines. 
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Figure 32. Computer Control Unit Set-up for High-Speed Micro-Leve! Interrupt Handling. Latency is a Maximum 


of Two Microcycles (i.e., about 300 to 500ns). . 


Figure 33. Example of Sync Control Logic and Vector Generator. 


Figure 34. Timing of Vector Generator and Sync Control Logic. 
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“This is a JSB instruction, but observe that the retum address will be the yel-to-be executed location A+3. 
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Figure 35. Interrupt Sequence Timing. 
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Figure 36. Return-From-interrupt Sequence Timing. 
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Figure 35 shows how the interrupt sequence timing fits Into the 
normal flow of microprogram address in the Am2910. Note how 
the stack is used. This demonstrates the need for always re- 
serving room on the stack to allow for interrupts. This applies to 
any room that the Interrupt service routine may requiro as well 
as the retum address. This limitation may require that only one 
Interrupt request be serviced at a time. 


Figure 36 shows how the return from the Interrupt service 
routine fits into the microprogram flow. Notice that a Retum 
instruction is used to accomplish this. 


SUMMARY 


In this chapter, interrupts were discussed beginning with a def- 
inition of the Interrupt Mechanism and proceeding to a classifi- 
cation of different interrupts and how they are handled. A dis- 


cussion of the concepts that go into designing the “Universal 
Interrupt” hardware was given which culminated with the 
Am2914. The chapter ends with several Interrupt Mechanism 
applications using the Am2914 and Am2910. 


In this chapter it was shown how interrupts can be handled using 
parts from the Am2900 family. Because of their hardware mod- 
ularity and universal architecture, they may be used in a variety 
of applications. Since the Am2900 Family parts are micropro- 
grammabie, they allow the user's system lo grow with time as 
system requirements change. Together these attributes make 
the Am2900 Family the flexible cost effective family that it is. 
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Introduction 


The transfer of data between the microcomputer and the 
peripheral devices is generally reterred to as Input/Output (/O). 
What is desired Is a high speed technique of transferring data 
between the peripherals and the memory. Generally speaking, 
there is a minimum of three types of /O. These are, Programmed 
YO, Memory Mapped VO and Direct Memory Access I/O. All of 
these schemes are common in today’s currently available 
minicomputers. A basic understanding of these I/O techniques is 
helpful in fully comprehending DMA. The first two of these types 
of /O can be interrupt driven. Thatis, programmed I/O or memory 
mapped VO can be initiated by an interrupt from the peripheral 
device. 


Programmed UO 


In this type of VO, all operations are controlled by the CPU 
program. In other words, the peripheral device performs the 
functions of inputting or outputting data as it is controlled by the 
CPU. Normally, the machine wiil include a set of I/O instructions 
which are used to transfer data to or from the peripheral devices 
via an InpuvOutput port. All data for the peripheral devices 
Passes through these /O ports to the CPU and the resources of 
the CPU must be utilized in order to effect an /O transfer. Figure 1 
shows the Block Diagram of a programmed I/O system used ina 
typ:cal microcomputer. Figure 2 shows an example of that portion 
of the program used to outpul data to the peripheral device. 


LO PORT 


MPR-SS0 


Figure 1. Programmed W/O System. 


CPU Program 


Load R, M Load CPU Register R with the Contents of 


Memory Address M 


Transfer the Contents of CPU Register R to 
VO Device D via the V/O port. 


Out D, R 


Figure 2. Example Output Program — Programmed I/O. 


Programmed V/O is simple to implement and does not require the 
utilization of any memory addresses for its realization. In addition, 
speciai instructions are available to the programmer to execute 
the peripheral data transfers. Programmed UO is also low cost 
relative to other types of I/O; however, it has the following disad- 
vantages. Since I/O device operation is asynchronous with re- 


spect to CPU operation, the CPU has no way of knowing when a 
peripheral device is ready to transfer data and must periodically 
poll the device to determine its readiness. This results in an 
inefficient i/O transfer. Also, since the CPU must be used to effect 
the I/O transfer, the CPU resources are tied up during the time of 
transfer and the time of polling and cannot be used for other tasks. 
For these reasons, Programmed UO is generally limited to use 
with low speed devices. 


Perhaps, one of the best known programmed I/O microcom- 
puters in the industry today is the AM9080A. This device features 
two instructions for either inputung data or outputting data to any 
one of 256 Input/Output ports. 


Memory Mapped VO 


Memory Mapped I/O is a technique whereby the transfer of data 
to and from peripheral devices is accomplished by using some of 
the normally available memory space. In this technique, memory 
addresses are decoded within the peripheral devices and are 
thus used to determine when a specific device is being ad- 
dressed. Usually, each type of function within the peripheral 
device is assigned a memory address and can then be accessed 
by the CPU. For example, the pernpheral device may contain a 
command register, a status register, a data in register and a data 
out register. Thus, four memory addresses might be utilized in 
performing /O to this peripheral. Figure 1 is also the block dia- 
gram for a Memory Mapped VO scheme. 


The chief advantage of Memory Mapped I/O is that all of the 
memory reference instructions are usually available to perform 
the YO function. Consequently, no special V/O instructions are 
required in the machine. The key disadvantage of this technique 
is that a block of the memory addressing range must be set aside 
for assignment to the peripheral devices. Thus, the overall mem- 
ory addressing range of the machine is reduced by the size of this 
block. Again, the resources of the CPU are tied up while the I/O is 
being performed. A well known machine using only Memory 
Mapped I/O is the PDP 11. In it the upper 4k of memory space is 
usually used for the /O devices. 


Interrupt Driven YO 


Interrupts are means by which a peripheral device can stop the 
normal flow of the CPU instruction execution and force the CPU to 
temporarily suspend its current program. Then, the program 
“jumps” to a different program which executes an V/O transfer. 
Typically, this eliminates the need for polling the peripheral de- 
vices to determine if an I/O transfer is ready. Thus, the interrupt 
driven scheme provides a more efficient /O transfer technique. 
However, there is an overhead burden associated with interrupts 
in that the CPU must store away and later restore all of the 
parameters required to resume the interrupted program. This 
overhead degrades the CPU performance. Depending on the 
overall interrupt structure, the CPU still may have to do some 
polling of devices which may be tied to the same interrupt level. 


It should be pointed out that both Programmed VO and Memory 
Mapped can take advantage of the interrupt technique. Thatis, an 
interrupt can be used to initiate the peripheral data transfer in 
either type of system. The CPU still must control the transfer of 
the data between the memory and the peripheral device and the 
CPU resources are inavailable for executing other instructions 
during this time. 


What is DMA? 


DMA is a technique for data transfer which provides a direct path 
between the V/O device and the memory without CPU :nterven- 
tion. With this path, a peripheral device has “Direct Memory 
Access” and can transfer data directly to or from the memory. The 
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Figure 3. DMA I/O System. 


purpose of the DMA is to relieve the CPU of the task of controlling 


the VO transfe:, thereby freeing it to perform other tasks during 
this time, and to provide a means by which data can be transfer- 
fed between an I/O device and memory at very high speed. 
Figure 3 shows the Block Diagram of a system where several /O 
devices can perform DMA transfers into memory. Note that the 
CPU and peripheral devices share a common bus to the memory 
and that the CPU and peripherai devices cannot access memory 
during the same cycle. DMA can also be designed to perform 
memory-to-memory transters or VO-to-V/O transfers. 

Several DMA transfer methods exist, such as the CPU halt 
method, the memory timeslice method, and the “cycle steal” 
method. In the CPU halt method, the CPU is halted and switched 
off the bus while a OMA transfer occurs. This is the most 
straightforward method. However, it takes a relatively long time to 
switch the CPU on and off the bus, and the CPU cannot do 
anything during the transfer. 


The memory timeslice method works by splitting each memory 
cycle into two timeslots; one is reserved for the CPU and the other 
for DMA. This method provides the highest CPU execution rate 
as well as the highest DMA transfer rate because both the CPU 
and DMA are guaranteed access to memory during every mem- 
ory cycle. The disadvantage of this method is that high speed, 
costly memories must be used. 


The “cycle steal method is a cost/performance compromise 
between the low cost of the CPU halt method and the high 
performance of the memory timeslice method. Cycle stealing 
refers to a DMA device “stealing” a CPU memory cyclein order to 
execute a OMA transfer. CPU program execution continues dur- 
ing the DMA transfer (the CPU is not halted), resulting In an 
overlap of CPU program execution with DMA transfer. If the CPU 
and a DMA device require a memory cycle at the same time, 
Priority is granted to the DMA device and the CPU waits until the 
OMA cycle is completed. OMA causes CPU performance degra- 
dation only in those applications where the CPU uses the entire 
memory bandwidih. In many applications the CPU is slow relative 
to memory cycle time and “cycle stealing” provides satisfactory 
performance at relatively low cost 


How is DMA Implemented? 

In order to relieve the CPU of the V/O transfer control task, circuitry 
extemal to the CPU must be added. This circuitry is called the 
DMA Contoller and performs the foliowing functions. 


MPA-551 


Address Line Control — ina OMA system, the memory address 
lines are driven by either the CPU or a DMA device, depending on 
which is using the memory during a given cycle. The DMA con- 
troller must switch the appropnate address onto the memory 
address lines, 


Data Transter Control — The DMA Controller must provide the 
control signals required to transfer data directly between memory 
and an /O device. As with the address lines, these control signals 
must be switched onto and off of the memory control lines appro- 
priately. 


Address Maintenance — Just as the CPU has the program 
counter and one or more other registers for memory address 
pointers, the DMA controller must also maintain an address 
pointer that indicates where the next word of data will be read or 
written in memory. This pointer must be incremented or dec- 
remented after each word transfer. 


Word Count Maintenance — At the initialization of a DMA 
transfer, the CPU specifies to the OMA Controller the total 
number of words to be transferred. During the transfer, the OMA 


controller must maintain a count of the number of words thathave ~ 


been transferred and terminate the transfer when the specilied 
number of words has been reached. 


Mode Control — Certain aspects of a DMA transfer, such as 
direction of data flow, method of termination, etc., may vary from 
one OMA transfer to the next. For this reason, a number of DMA 
modes may be required. Mode contro! logic contained inthe DMA 
controller, is set by the CPU at the initialization of a DMA transfer. 


A DMA Controller can be placed in each VO device (Distributed 
OMA) or DMA control circuitry for a number of I/O devices can be 
placed in a separate unit (Centralized DMA). The former provides 
the advantage of incremental cost; DMA control circuitry is added 
only as VO devices are added. The latter provides the advantages 
of consolidation. 


At OMA Initialization, the CPU normally specifies the mode, the 
starting memory address and the number of words to be trans- 
ferred (word count) to the DMA controller. In some applications, it 
is desirable to rapeat a DMA transfer over and over again without 
disturbing the CPU. This capability is called Repetitive DMA, and 
can be implemented by adding two registers to the DMA con- 
troller. One register saves the starting address and the other the 
Starting word count. This allows the OMA Controller to automati- 
cally reinitialize itself after the transfer of the data has been 
completed, thereby eliminating the need for CPU intervention. 


The Am2940 OMA ADDRESS GENERATOR 


The design of the Address Line Control, Data Transfer Control 
and Mode Control circuitry of a OMA Controller is dependent 
upon system architecture and timing; therefore, it varies con- 
siderably from system to system. However, the address mainte- 
Nance and word count maintenance circuitry is independent of 
these variables, and is common to almost all OMA Controllers. 
The Am2940 OMA Address Generator is designed for use inDMA 
Controllers and provides the Address and Word Count mainte- 
nance circuitry that is common to most. t combines the advan- 
tages of high speed bipolar LSI with the flexibility and general 
purpose usefulness of microprogrammed control. 


Am2940 GENERAL DESCRIPTION 


The Am2940, a 28-pin member of Advanced Micro Devices 
Am2900 family of Low-Powei Schottky bipolar LS! chips, is a 
high-speed, cascadable, eight-bit wide Direct Memory Access 
Address Generator slice. Any number of Am2940s can be cas- 
caded to form larger addresses. 


The primary function of the device is to generate sequential 
memory addresses for use in the sequential transfer of data to or 
from a memory. tt also maintains a data word count and gener- 
ates a DONE signal when a programmabie terminal count has 
been reached. The device is designed for use in peripheral con- 
trollers with DMA capability or in any other system which transfers 
Gata to or from sequential locations of a memory. 


The Am2940 can be programmed to increment or decrement the 
memory address in any of four control modes, and executes eight 
different instructions. The initial address and word count are 
Saved internally by the Am2940 so that they can be restored later 
in order to repeat the data transfer operation. 


a 


INSTRUCTION 


Am2940 ARCHITECTURE 


As shown in the Block Diagram of Figure 4, the Am2840 consists 

of the following: 

@ A three-bit Control Register. 

@ An eight-bit Address Counter with input muttiplexer. 

@ An eight-bit Address Register. 

@ An eight-bit Word Counter with input multiplexer. 

@ An elght-bit Word Count Register. 

e@ Transfer complete circuitry. 

@ An eight-bit wide data multiplexer with three-state output but- 
fers. 

@ Three-state address output buffers with extemal output enable 
control. 

@ An instruction decoder. 


Control Register 


Under instruction control, the Control Register can be loaded or 
read trom the bidirectional DATA lines O9-07, Control Register 
bits 0 and 1 determine the Am2940 Control Mode, and bit 2 
determines whether the Address Counter increments or dec- 
tements. Figure 5 defines the Control Register format. 


Address Counter 


The Address Counter, which provides the current memory ad- 
dress, is an eight-bit, binary, up/down counter with full look-ahead 
carry generation. The Address Carry Input (Ach) and Address 
Carry Output (ACO) allow cascading to accommodate larger 
addresses. Under instruction control, the Address Counter can 
be enabled, disabled, and loaded from the DATA inputs, Dg-D;, 
or the Address Register. When enabled and the ACI input is 
LOW, the Address Counter increments/decremenis on the LOW 
to HIGH transition of the CLOCK input, CP. The Address Counter 
output can be enabled onto the three-state ADDRESS outputs 
Ao-Az under controt of the Output Enable input, OE,. 


WORD COUNT 
REGISTER 


CONTROL 
WORD COUNTER 


3 INSTRUCTION : 
" 7 2 = : 


Figure 4. Am2940 DMA Address Generator. 
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Figure 5. Control Register Format Definition. 


Address Register 


The eight-dit Address Register saves the initial address so that it 
can be restored later in order to repeat a transfer operation. When 
the LOAD ADDRESS instruction is executed, the Address Reg- 
ister and Address Counter are eimonansously loaded from the 
DATA inputs, Do-Dy. 


Word Counter and Word Count Reg|lster 


The Word Counter and Word Count Register, which maintain and 
Save a word count, are similar in structure and operation to the 
Address Counter and Address Register, with the exception that 
the Word Counter increments in Control Modes 1 and 3, decre- 
ments in Control Mode 0, and is disabled in Control Mode 2. The 
LOAD WORD COUNT instruction simultancously loads the Word 
Counter and Word Count Register. 


Transfer Complete Circultry 


The Transfer Complete Circuitry is a combinational logic network 
which detects the completion of the data transfer operation in 
three Control Modes and generates the DONE output signal. The 
DONE signal is an open-coilector output, which can be dot-anded 
between chips. 


Data Multiplexer 


The Data Multiplexer is an eight-bit wide, 3-input multiplexer 
which allows the Address Counter, Word Counter, and Control 
Register to be read at the DATA lines, Dg-Dy. The Data Multi- 
plexer and three-state Data output buffers are instruction con- 
trolled. 


Address Output Suffers 


The three-state Address Output Buffers allow the Address 
Counter output to be enabled onto the ADDRESS lines, Ag-Ay, 
under external control. When the Output Enable input, OE,, is 
LOW, the Address output butfers are enabled; when O€, is 
HIGH, the ADDRESS lines are in the high-impedance state. The 
address and Data Output Butfers can sink 24mA output current 
over the commercial operating range. 


Instruction Decoder 


The instruction Decoder generates required intemal control sig- 
nals as a function of the INSTRUCTION inputs, !o-lz and Control 
Register bits 0 and 1. 


Clock 


The CLOCK input, CP, is used to clock the Address Register, 
Address Counter, Word Count Register, Word Counter, and 
Control Register, all on the LOW to HIGH transition of the CP 
signal. 


Am2940 CONTROL MODES 


Contro! Mode 0 — Word Count Equals Zaro Mode 


in this mode, the LOAD WORD COUNT instruction toads the 
word count into the Word Count Register and Word Counter. 


When the Word Counter is enabled and the Word Counter. 


Carry-in, WCl, is LOW, the Word Counter decrements on the 
LOW to HIGH transition of the CLOCK input, CP. Figure 5 
specifies when the DONE signal is generated in this mode. 


Control Mode 1 — Word Count Compare Mode. 


In this mode the LOAD WORD COUNT instruction loads the word 
count into the Word Count Register and clears the Word Counter. 
When the Word Counter is enabled and the Word Counter 
Carry-in, WCl, Is LOW, the Word Counter increments on the LOW 
to HiGH transition of the clock input, CP. Figure 5 specifies when 
the DONE signal is generated. 


Control Mode 2 — Address Compare Mode 


In this mode, only an initial and final memory address need be 
specified. The initial Memory Address Is loaded into the Address 
Register and Address Counter and the final memory address is 
loaded into the Word Count Register and Word Counter. The 
Word Counter is always disabled In this mode and serves as @ 
holding register for the final memory address. When the Address 
Counter is enabled and the ACI input is LOW, the Address 
Counter increments or decrements (depending on Control Reg- 
ister bit 2) on the LOW to HIGH transition of the CLOCK Input, CP. 
The Transfer Complete Circuitry compares the Address Counter 
with the Word Counter and generates the DONE signal during the 
last word transfer, i.e., when the Address Counter equals the 
Word Counter. 


Control Mode 3 — Word Counter Carry Out Mode 


For this mode of operation, the user can load the Word Count 
Register and Word Counter with the two's complement of the 
number of data words to be transferred. When the Word Counter 
is enabled and the WCI input is LOW, the Word Counter incre- 
ments on the LOW to HIGH transition of the CLOCK input, CP. A 
Word Counter Carry Out signal, WCO, indicates the last data 
word is being transferred. The DONE signal is not required in this 
mode and, therefore, is aways LOW. 


Am2940 INSTRUCTIONS 


The Am2940 instruction set consists of eight instructions. Six 
instructions load and read the Address Counter, Word Counter 
and Control Register, one instruction enables the Address and 
Word Counters, and one instruction reinitializes the Address and 
Word Counters. The function of the REINITIALIZE COUNTERS, 
LOAD WORD COUNT, and ENABLE COUNTERS instructions 
vary with the Control Mode being utilized. Table 1 defines the 
Am2940 Instructions as a function of Instruction inputs fg-!2 and 
the four Am2940 Control Modes. 


The WRITE CONTROL REGISTER instruction writes DATA 
input Do-D2 into the Control Register; DATA inputs D;-D, are 
“don't care” inputs for this instruction. The READ CONTROL 
REGISTER instruction gates the Control Register outputs to 
DATA lines, Do-D2. DATA lines 03-D; are in the HIGH state 
during this instruction. 


The Word Counter can be read using the READ WORD 
COUNTER instruction, which gates the Word Counter ouputs to 
DATA lines O9-Dy. The LOAD WORD COUNT instruction is 
Control Mode dependent. In Control Modes 0, 2, and 3, DATA 
inputs Dy-Dy are written into both the Word Count Register and 
Word Counter. In Control Mode 1, DATA inputs Dp-D7 are written 
into the Word Count Register and the Word Counter is cleared. 


The READ ADDRESS COUNTER instruction gates the Address 
Counter outputs to DATA lines Dg-D7, and the LOAD ADDRESS 
instruction writes DATA inputs D9-D; into both the Address Reg- 
ister and Address Counter. 


tn Controt Modes 0, 1, and 3, the ENABLE COUNTERS instruc- 
tion enables both the Address and Word Counters; in Control 
Mode 2, the Address Counter is enabled and the Word Counter 
holds its contents. When enabled and the carry inputis active, the 
counters increment on the LOW to HIGH transition of the CLOCK 
input, CP. Thus, with this instruction applied, counting can be 
controlled by the carry inputs. 


The REINITIALIZE COUNTERS instruction also is'Control Mode 
dependent. In Control Modes 0, 2, and 3, the contents of the 
Address Register and Word Count Register are transferred to the 
respective Address Counter and Word Counter; in Control Mede 
1, the content of the Address Register ts transferred to the Ad- 
dress Counter and the Word Counter is cleared. The 
REINITIALIZE COUNTERS instruction allows a data transfer 
operation to be repeated without reloading the address and word 
count from the DATA lines. 


Am2940 Timing 


Various computations must be performed by the designer to 
determine how fast the Am2940 can be operated reliably in a 
given design. The exercises of this section demonstrate how 
these computations are performed. 


Worst case A.C. characteristics, over the full temperature and 
voltage operating range should be used in these computations. 
Since, at the time of this writing, the Am2940 is still being charac- 
terized, only typical A.C. characteristics are available. These 
typicals are used here merely to demonstrate how the computa- 
tions are performed; the designer must use worst-case charac- 
teristics. Figure 6 shows the characteristics of a Schottky register 
and a memory which are assumed for this exercise. 


Figures 7A, B, and C show the typical cycle time calculations for. 


.the 16-bit Am2940 configuration. The typical delay along the 


longest path for any of the eight Am2940 instructions determines 
the typical cycle time. In each case, delays are computed from the 
LOW {to HIGH transition of a clock through an entire microcycle to 
the next LOW to HIGH transition of a clock. The typical cycle time 
for a 16-bit Am2340 configuration is G4ns. 


TABLE 1. Am2940 INSTRUCTIONS 
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Figure 8 shows the address output enable time computations, 
Since the Am2940 has an asynchronous address output enable 
control, the address output enable time may not be related to the 
Am2940 cycle time. 


Schottky Redister Figure 9 shows the typical cycle time calculation for an 8-bit 
Clock to Output Delay Am2940 configuration. The path shown is the longest path and 
Input Set-Up Time . determines an 8-bit typical cycle time of 52ns. 


Memory The typical cycle time calculation for a 24-bit Am2940 configura- 
Address Set-Up Time tion is shown in Figure 10. The path shown is the longest path and 
determines a 24-bit typical cycle time of 76ns. 


Figure 11 is a summary of typical Am2940 cycle times for the 8, 16 
Figure 6. Assumed AC Characteristics. and 24-bit configurations. 
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Figure 7. 16-Bit Typical Cycle Time Computations. 
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Figure 7. 16-Bit Typical Cycle Time Computations. (Cont) 
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Figure 9. 6-Bit Typical Cycle Time Computation. 
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Figure 11. Summary of Am2940 Cycle Times. _ — 
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AN EXAMPLE DESIGN 


The Am2940 is designed for use in high speed peripheral Con- 
trollers using OMA and provides the address and word count 
maintenance circuitry that is common to most. As indicated previ- 
ously, OMA Control can be placed in each /O Controller (Distri- 
buted DMA) or OMA Control for a number of I/O devices can be 
centralized in a separate unit. ° 


Figure 12 shows a block diagram of a microprogrammed W/O 
Controller which is designed for use in a Distributed DMA system. 
The Am2910 Microprogram Sequencer, Microprogram Memory 
and the Microinstruction Register form the microprogram control 
Portion of this /O Controlier. The Am2940 maintains the memory 
address and word count required for OMA operation. An internal 
three-state bus provides the communication path between the 
Microinstruction Register, the Am2917 Data Transceivers, the 
Am2940, the Am2901A Microprocessor, and the Device interface 


Circuitry. The Address Line Control, Data Transfer Contro! and 
Mode Control functions of this DMA Controller are incorporated 
into the /O Controller Microprogram and the Asynchronous 
Interface Control Circuitry. The VO Controller Microprogram also 
controls the Am2940. 


The Am2940 interconnections are shown in detail in Figure 13. 
Two Am2940s are cascaded to generate a sixteen-bit address. 
The Am2940 ADDRESS and DATA output current sink capability 
is 24mA over the commercial operating range. This allows the 
Am2940s to drive the System Address Bus and Internal Three- 
State Bus directly, thereby eliminating the need for separate bus 
drivers. Three bits in the Microinstruction Register provide the 
Am2940 Instruction Inputs, o-l2. The microprogram clock is used 
to clock the Am2940s and, when the ENABLE COUNTERS in- 
struction is applied, address and word counting is controlled by 
the CNT bit of the Microinstruction Register. 


Asynchronous interface control circuitry generates System Bus 
control signals and enables the Am2940 Address onto the Sys- 
tem Address Bus at the appropriate time. The open-collector 
DONE outputs are dot-anded and used as a test input to the 
Am2910 Microprogram Sequencer. 


The I/O controller read operation ts flowcharted in Figure 14. The 
CPU initializes the !/O controller by sending a read command, the 
starting memory address, the word count and any other parame- 
ters required to perform the operation. The UO Controller then 


. obtains a word of data from the VO device and requests use of the 


system bus for a DMA transfer. When the bus is granted, the I/O 
Controller requests a memory data transfer. Upon receipt of the 
memory acknowledge signal, which indicates the memory trans- 
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Figure 13, Am2940 Interconnections. ; / ’ 
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Figure 14. Read Control Flowchart. 


fer is complete, the l/O Controller tests the word count. if the word 
count is not equal to zero, the word counter is decremented, the 
address counter is incremented and another data word is trans- 


ferred. When the word count reaches zero, the /O Controller : 


terminates the data transfer and informs the CPU that the transfer 
has been completed. 


THE Am2942 PROGRAMMABLE TIMER/COUNTER, 
DMA ADDRESS GENERATOR. 


GENERAL DESCRIPTION: 


The Am2942, a 22-pin version of the Am2940, can be used as 
a high-speed DMA address Generator or Programmable 
Timer/Counter. It provides multiplexed Address and Data lines, 
for use with a common bus, and additional! Instruction Input and 
Instruction Enable pins. The Am2942 executes 16 instructions; 
eight are the same as the Am2940 instructions, and eight in- 
structions facilitate the use of the Am2942 as a Programmable 
Timer/Counter. The Instruction Enable input allows the sharing 
of the Am2S42 instruction field with other devices. 


When used as a Timer/Counter, the Am2942 provides two in- 
dependent, programmable, eight-bit, up-down counters in a 
22-pin package. The two on-chip counters can be cascaded to 
form a single chip, 16-bit counter. Also, any number of chips 
can be cascaded — for example three cascaded Am2942s form 
a@ 48-bit timer/counter. 

Reinitialization instructions provide the capability to reinitialize 
the counters from on-chip registers. Am2942 Programmable 
Control Modes, identical to those of the Am2940, offer four dif- 
ferent types of programmable control. 


Am2942 ARCHITECTURE 

As shown in the Block Diagram, the Am2942 consists of the 

following: 

e A three-bit Control Register. 

@ An eight-bit Address Counter with input multiplexer. 

@ An eight-bit Address Register. 

@ An eight-bit Word Counter with input multiplexer. 

@ An eight-bit Word Count Register. 

e@ Transfer complete circuitry. 

@ An eight-bit wide data multiplexer with three-state output 
buffers. 


_@ An instruction decoder. 
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Figure 15. Am2942 Block Diagram. - 


Control Register addresses. Under Instruction control, the Address Counter can 


Under instruction control, the Control! Register can be loaded or be enabled, disabled, and loaded from the DATA inputs, Dg-D7, 
tead from the bidirectional DATA lines, Do-D7. Control Register or the Address Register. When enabled ard the ACI input is 
bits 0 and 1 determine the Am2942 Contro! Mode, and bit 2 LOW. the Address Counter increments/decrements on the LOW 


determines whether the Address Counter increments or dec- to HIGH transition of the CLOCK input, CP. 


rements. Figure 16 defines the Contro! Regi f t 
m gui lefines tf egister forma Address Register 


Address Counter The eight-bit Address Register saves the initial address so that it 
The Address Counter, which provides the current memory ad- canbe restored later in order torepeata transfer operation. When 
dress, is aneight-bit, binary, up/down counter with fult look-ahead ~ the LOAD ADDRESS instruction is executed, the Address Reg- 
Carry generation. The Address Carry input (ACI) and Address ister and Address Counter are simultaneously Icaded trom the 
Carry Output (ACO) allow cascading to accommodate larger DATA inputs, Do-D7. 


HIGH when 
Word Counter = 0 


HIGH when 


Word Counter + 1 = Word Counter « 
Word Count Register Word Count Regisier 


HIGH when V/ord Counter = Address Counter 
Always LOW 


Figure 16. Control Register Format Definition. 


Word Counter And Word Count Register 


The Word Counter and Word Count Register, which maintain and 
save a word count, are similar in structure and operation to the 
Address Counter and Address Register, with the exception that 
the Word Counter incrernents in Control Modes 1 and 3 and 
decrements in Control Modes 0 and 2. The LOAD WORD 
COUNT instruction simultaneously loads the Word Counter and 
Word Count Rogister. 


Transfer Complete Circultry 


The Transfer Complete Circuitry is a combinational logic network 
which detects the completion of the data transfer operation in 
three Control Modes and generates the DONE output signal. The 
DONE signal is an open-collector output, which can be dot-anded 
between chips. 


Data Multiploxer 


The Data Multiplexer is an eight-bit wide, three-input multiplexer 
which allows the Address Counter, Word Counter and Control 
Register to be read at DATA lines Do-D7. The Data Multiplexer 
output, Yp-Y7, is enabled onto DATA lines Dg-D;, if, and only il, the 
Output Enable input, CEp, is LOW. (Refer to Figure 17.) 
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HIGH Z 
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Figure 17, Data Bus Output Enable Function. 


instruction Decoder 


The instruction Decoder generates required internal control sig- 
nals as a function of the INSTRUCTION inputs, Ip-l5 Control 
Register bits 0 and 1, and the INSTRUCTION ENABLE input, le. 


Clock 


The clock input. CP, is used to clock the Address Register, 
Address Counter, Word Count Register, Word Counter, and 
Control Register, all on the LOW to HIGH transition of the CP 
signal. 


Am2942 CONTROL MODES 


Contro} Mode 0 — Word Count Equals Zero Mode 


In this mode, the LOAD WORD COUNT instruction loads the 
word count into the Word Count Register and Word Counter. 
When the Word Counter is enabled and the Word Counter 
Carty-in, WCl, is LOW, the Word Counter decrements on the 
LOW to HIGH transition of the CLOCK input, CP. Figure 16 
specifies when the DONE signal is generated in this mode. 


Control Mode 1 — Word Count Compare Mode 


In this mode the LOAD WORD COUNT Instruction loads the word 
count into the Word Count Register and clears the Word Counter. 
When the Word Counter is enabled and the Word Counter 
Carry-in, WCI, is LOW, tne Word Counter increments on the LOW 
to HIGH transition of the clock input, CP. Figure 16 specifies 
when the DONE signal is generated. 


Control Mode 2 — Address Compare Mode 


in this mode, only an initial and final memory address need to be 
specified. The initial Memory Address {s loaded into the Address 
Register and Address Counter and the final memory address is 
loaded into the Word Count Register and Word Counter. The 
Word Counter serves as a holding register for the final memory 
address. When the Address Counter is enabled and the ACI input 
is LOW, the Address Counter increments or cecrements (de- 
pending on Control Register bit 2) on the LOW to HIGH transition 
of the CLOCK input, CP. The Transfer Complete Circuitry com 
pares the Address Counter with the Word Counter and generates 
the DONE signal during the last word transfer, i.e., when the 
Address Counter equals the Word Counter. 


Control Mode 3 — Word Counter Carry Out Mode 


For this mode of operation, the user can load the Word Count 
Register and Word Counter with the two's complement of the 
number of data words to be transferred. When the Word Counter 
is enabled and the WCi input is LOW, the Word Counter incre- 
ments on the LOW to HIGH transition of the CLOCK input, CP. A 
Word Counter Carry Out signal, WCO, indicates the last data 
word is being transferred. The DONE signal is not required in this 
mode and, therefore, is always LOW. 


Am2942 INSTRUCTIONS 


The Am2942 instruction set consists of sixteen instructions. 
Eight are DMA instructions and are the same as the Am2940 
instructions. The remaining eight instructions are designed to 
facilitate the use of the Am2942 as a Programmable Timer/ 
Counter. Figures 18 and 19 define the Am2942 Instructions. 


Instructions 0-7 are DMA instructions. The WRITE CONTROL 
REGISTER instruction writes DATA input Dp-Dz into the Control 
Register; DATA inputs D3-D, are “don't care” inputs for this 
instruction. The READ CONTROL REGISTER instruction gates 
the Control Register to Data Multiplexer outputs Yo-Y2. Outputs 
Y3-Y7 are HIGH during this instruction. 

The Word Counter can be read using the READ WORD 
COUNTER instruction, which gates the Word Counter to Data 
Multiplexer outputs, Yg-Y7. The LOAD WORD COUNT instruc- 
tion is Control Mode dependent. In Control Modes 0, 2 and 3, 
DATA inputs Dg-D, are written into both the Word Count Register 
and Word Counter. In Control Mode 1, DATA inputs Dp-D, are 
written into the Word Count Register and the Word Counter is 
cleared. 


The READ ADDRESS COUNTER instruction gates the Address 
Counter to Data Multiplexer outputs, Yo-Y7, and the LOAD AD- 
DRESS instruction writes DATA inputs Dg-D, into both the Ad- 
Gress Register and Address Counter. 


In Control Modes 0, 1, and 3, the ENABLE COUNTERS instruc- 
tion enables both the Address and Word Counters: in Control 
Mode 2, the Address Counter is enabled and the Word Counter 
holds its contents. When enabled and the carry input is active, the 
counters increment on the LOW to HIGH transition of the CLOCK 
input, CP. Thus, with this instruction applied, counting can be 
controlled by the carry inputs. 

The REINITIALIZE COUNTERS instruction also is Control Mode 
dependent. In Contro! Modes 0, 2, and 3, the contents of the 
Address Register and Word Count Register are transterred to the 
respective Address Counter and Word Counter: in Control Mode 
1, the content of the Address Register Is transferred to the Ad- 
dress Counter and the Word Counter is cleared. The 
REINITIALIZE COUNTERS instruction allows a data transfer 
operation to be repeated without teloading the address and word 
count from the DATA lines. 
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Notes: 1. Whon ly is ted LOW, the Am2942 acts as a OMA circuit: When |3 Is tied HIGH, the Am2942 acts 


as a Timer/Counter circuil 


2. Am2942 instructions 0 through 7 are the same as:Am2940 instructions. 


Figure 18. Am2942 instructions 


When ig is HIGH, Instruction inputs, I-la, are disabled. If 1 is 
LOW, the function performed !s identical to that of the 
ENABLE COUNTERS instruction. Thus, counting can be con- 
trolled by the carry inputs with the ENABLE COUNTERS instruc- 
tion applied or with Instruction Inputs Ip-lp disabled. 


Instructions 8-F facilitate the use of the Am2942 as a Pro- 
grammable Timer/Counter. They ditfer from instructions 0-7 in 
that they provide independent control of the Address Counter, 
Word Counter and Control Register. 


The WRITE CONTROL REGISTER, T/C instruction writes DATA 
input Dp-D2 into the Control Register. DATA inputs D3-D, are 
“don't care” inputs for this instruction. The Address and Word 
Counters are enabled, and the Control Register contents appear 
at the Data Multiplexer output. 


The REINITIALIZE ADDRESS COUNTER instruction allows the 
independent reinitialization of the Address Counter. The Word 
Counter is enabled and the contents of the Address Counter 
appear at the Data Multiplexer output 


The Word Counter can be read, using the READ WORD 
COUNTER, T/C instruction. Both counters are enabled when this 
instruction is executed. 

When the READ ADDRESS COUNTER, TIC instruction is exe- 
cuted, both counters are enabled and the address counter con- 
tents appear at the Data Muluplexer output. 

The REINITIALIZE ADDRESS and WORD COUNTERS instruc- 
tion provides the capability to reinitialize both counters at the 
same time. The Address Counter contents appear at the Data 
Muluplexer output. 


DATA inputs Do-D7 are loaded into both the Address Register 
and Counter when the LOAD ADDRESS, TIC instruction is exe- 
cuted. The Word Counter Is enabled and its contents appear at 
the Data Multiplexer output. 


The LOAD WORD COUNT, T/C instruction is identical to the 
LOAD WORD COUNT instruction with the exception that Ad- 
dress Counter is enabled, 


The Word Counter can be independently reinitialized using the 
REINITIALIZE WORD COUNTER instruction. The Address 
Counter is enabled and the Word Counter contents appear atthe - 
Data Multiplexer output. 


When the Te Input is HIGH, Instruction nips: 'p-lo, are disabled. 
The function performed when I is HIGH is identical to that per- 
formed when I is LOW, with the exception that the Word Counter 
coments appear at the Data Multiplexer output. 


EXAMPLE DESIGNS 


Figure 20 shows an Am2942 used as two independent, pro- 
grammable eight-bit timer/counters. In this example, an 
Am2910 Microprogram Sequencer provides an address to 
Am29775 512 x 8 Registered PROMs. The on-chip PROM out- 
put register is used as the Microinstruction Register. 


The Am2942 Instruction input, f; is ted HIGH to select the eight 
Timer/Counter instructions. The le. Iga, and OED inputs are 
provided by the microinstruction, and the Do-D, data lines are 
connected to a common Data Bus. GATE WC and GATE AC are 
separate enable controls for the respective Word Counter and 
Address Counter. The DONE, ACO and WCO output signals 
indicate that a pre-programmed time or count has been reachdd. 
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Figure 19. Am2942 Function Table. 
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Figure 20. Two 8-Bit Programmable Counters/Timers in a 22-Pin Package. 


Figure 21 shows an Am2942 used as a single 16-bit, pro- Figure 22 shows two Am2942s cascaded to form a 32-bit pro- 
grammable timer/counter. In this example, the Word Counter grammable timer/counter. The two Word Counters form the iow 
Carry-out, WCO, is connected to the Address Counter carry-in, order 16 bits, and the two Address Counters form the high 
ACI, to form a single 16-bit counter which is enabled by the order bits. This allows the timer/counter to be loaded and read 
GATE signal. 16 bits ata time. 
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* Figure 21. 16-Bit Programmable Counter/Timer Using a Single Am2942. 
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Figure 22. 32-Bit Programmable Counter/Timer Using Two Am2942s. 


In Figure 23, two-Am2942s are shown cascaded to form dual Figure 24 shows two Am2942s used as DMA address Generators 
16-bit counters/timers. GATE WC and GATE AC are separate on a common DATA/ADDRESS bus. The common bus allows the 
enable controls for the respective Word Counter and Address use of the Am2942 multiplexed data and address pins, Dg-D7. The 
Counter. Using the 16-bit Data Bus, each 16-bit counter can be Am2942 is in a 22 pin package whereas the Am2940, which has 
loaded or read in parallel. separate address and data pins, requires a 28 pin package. 
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Figure 23. Dual 16-Bit Programmable Counters/Timers. 
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Figure 24. Am2942s Used as DMA Address Generator on Common Bus. 


In this example the Am2942 Address Counter, Word Counter 
and Control Register are toaded and read directly from the 
CPU via the DATA/AOORESS bus. Since the bus carries ad- 
dresses as well as data, the Dg-D7 pins can be used also to 
enable the address onto the bus. 


Four bits in the Microinstruction Register provide the Am2942 
Instruction inputs, \g-lz and the Instruction Enable input Ie. The 
|, input is ted LOW, selectng the cight DMA instructions. The 


microprogram clock Is used to clock the Am2942s, and when 
the ENABLE COUNTERS instruction is applied or the instruc- 
tion is disabled (Ig = HIGH), address and word counting is 
controlled by the CNT bit of the Microinstruction Register. 


Interface contro! circuitry generates bus control signals and en- 
ables the Am2942 address onto the bus at the appropriate, The 
open-coliector DONE outputs are dot-anded and used as a test 
input to the microprogram sequencer. 
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Chapter VIII 
HEX-29 


INTRODUCTION 


Modern digital systems are becoming faster and increas’ igly 
complex. As a result, more is being demanded of digital désign 
engineers. Fortunately, there is a design technique that can 
greally simplify the design process. It can also lead to cleaner, 
more efficient, more reliable finished devices. This technique Is 
called MICROPROGRAMMING. Do not be confused by this 
word, it has nothing whalever to do with machine level language 
of programming a microprocessor. Microprog:amming is inher- 
ently more powerful than programming in a processor's instruc- 
tion set for many reasons, not the least of which is the access to 
the entire functional resources of the hardware on a machine 
cycle by machine cycle basis. An excellent treatment of micro- 
programming and microprogrammed machines is available from 
AMD in previous application notes. Perhaps the easiest to com- 
prehend introduction to this subject is in AMD's Microprogram- 
ming Handbook. This is highly recommended reading for any 
Newcomer to wus area of digital design. 


Though microprogamming has always been an inherently more 
powertu! design technique since its invention in 1955, it has been 
iittle used until recently (1976), and with some justification. The 
reason is quite simple. The very large majonty of IC's available 
until the 1976-1976 time frame were specifically designed to be 
used with ‘random logic’ design techniques. Since these random 
logic IC's were poorly suited to the highly structured nature of well 
designed microprogrammed systems, the potential advantages 
of microprogrammed systems could not be realized easily. 


Fortunately for all of us. in the mid 1970's AMD made a significant 
decision to develop a very extensive family of Schottky technol- 
ogy IC's specifically optimized tor use in microprogrammed sys- 
tems. These circuits belong to the Am2900 family as well as the 
Am25S, Am26S, Am27S, and Am25LS families. The acceptance 
has been so great that many of the other farge IC manufacturers 
are now second sourcing many of these parts and introducing 
others. So, in just three to lour years, microprogrammed machine 
design has come of age. Now, for most any job of medium to very 
high complexity, a microprogrammed system is the only way to go 
if a microprocessor isn't fast or versatile enough. 


The purpose of this application note Is to illustrate the use of 
microprogramming and ‘bit-slice’ technology in a high pertor- 
mance 16-bit time-sharing CPU. This application note is unique in 
that the CPU being described is the heart of a new commercially 
avatlable minicomputer system. Thus, it is possible to examine 
the nature of the CPU as it relates to a complete basic minicom- 
puter system. For this reason, a very short section follows that 
describes the basic system elements and the system goals to- 
ward which the CPU was designed. 


The product described herein is called the "HEX-29" CPU. Infor- 
Mation on the AMD devices embodied in this application note 
should be directed to AMD via your local AMD representatives. 
Inquiries about the HEX-29 CPU and minicomputer system for 
OEM and/or end user applications should be directed to: 


HEX-29 

Digital Microsystems, Inc. 
4448 Piedmont 

Oakland, CA 94661 

(415) 658-8532 


SYSTEM DESIGN GOALS 


In any significant project it is mandatory that reasonable, coher- 
ent system design goals bea spelled out belore sevious woik is 
begun. This can be a suronsingly short tis: of General specifica: 
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tions, but a well thought out system phitosophy can make ail the 
difference. Most important, everyone involved shouid have a 
copy $0 everyone will be pulling in the same direction. 


The following list represents the system design goals for the 
HEX-29 CPU and system. 


. Compact, reliable, easy to use. 

Multi-user, multi-task, timeshaning. 

. Fast, code-efficient high level language process’1g. 

. Low cost for complete system. 

. Intelligent microprogrammed channel controilers for high 
speed W/O. 


Wawons 


Indeed, this seems like a short list, but it ts the itst from which the 
more detailed specifications were developed. For example, in 
order to be compact, switching power supply lechnology is em- 
ployed. Rebability evolves (rc ~» many factors including bum in 
and testing cycles. Probably the si<.g!e ‘argest cause of flakiness” 
in digital systems is insutficien! > ing. An oversize fan moves 
about five times the volume of air past the IC's as ts normally 
recommended. This targa, slower speed fan has the additional 
advantage that the lower frequency ‘white noise’ it generales is 
far fess annoying than the ‘whine’ from smaller high speed fans. 


So. it is easy to see that many of the more specific details of 
system design will fall readily out of these overall design goals. 
The features of the final HEX-29 system are shown below. It 
should be instructive to trace each of these features to cne (or 
more) of the design goals listed above. Reviewing this list will also 
Prepare the context for the more detailed sections to follow in later 
sections. 


HEX-29 FEATURES 


VERY FAST 

-160ns basic machine cycle 

—-Only two machine cycles for many instructions 
~Microprogrammed clock for increased through-put 


COMPLETE SET OF DATA TYPES 
—Bit operations 

—-Nibble operations 

—-Byte operations 

~Word operations 

Double word operaiions 

~Quad word operations 

~ Variable field operations 


EXTENSIVE REGISTER SET 

16 general purpose’defined purpose registers 
-16 memory management registars 
-Extended function condition code registes 

—4 interrupt control/status registers 


MICROPROGRAMMED im , 
—Expandabie instruction set (on board) 

~Writable/fixed control store capabnity 

~ Integral itxed:fioating point processor 

Highly structured, comprehendible, modular des:qn 


SOPHISTICATED MEMORY MANAGEMENT 

-Multi-user and muiti-tas" timeshaiura avuctuie 

-Complete inte-task nrotectiun and secumy 

-Megabyte address'ng space (expandable) 

—Solftvcare protectat'e pages for snared re-entiant coding 

—Dual mode operating capability 

MULTIPLE STACK PRCCESSOR 

—Soaphistcalen program iinkuge inrough defined contrct such 
Bouter 
Miho Secciu register, daia stacn processing 
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SOPHISTICATED INTERRUPT STRUCTURE 

~8 level maskable vectored prioritized hardware interrupts 
~-Second level prioritized expansion on each hardware tevel 
~-256 levels of program controlled software interrupts 

— Invalid memory access trap is a vectored interrupt 
-Non-existent instruction trap is a vector interrupt 
-Breakpoint instruction is a special vectored interrupt 
—-Automatic mode switching on all interrupts 


HIGH THRU-PUT DMA/REFRESH CONTROL 

—8 level prioritized DMA requests and acknowledges 

— Up to four Mega-byte/second DMA transfer rate without slowing 
program execution 

~-Up to 12 Mega-byte’second DMA transfer rate 

~ Integra! transparent dynamic memory refresh control 


EXTENSIVE HIGH LEVEL INSTRUCTION SET 

~Multitude of data types handled 

-Enormous variety of addressing modes : 

—-General register and defined register classes of instructions 

—Many very fasi numeric and string macroinstructions 

-Integral 16 and 32-bit integer and 64-bit floating point ADD, 
SUB, MUL, Div, CMP, NEG, etc. 

- Advanced character, byte and word string processing 

~Microcoded high level language primitives 


VERY HIGH QUALITY PHYSICAL DESIGN 

~Four layer P.C. cards throughout system (internal GND and 
Vec) 

~All bus signals interleaved with direct retum ground path 

-All bus signals active low: three-state to inactive level 


INTELLIGENT CHANNEL CONTROLLERS 

-Microprogrammed floppy disk and hard disk controllers 

—Services multiple users ’O siultaneously and transparent to 
CPU program execution 

~Reduces executive program complexity and speeds execution 


SOFTWARE SUPPORT 


—-Mult-user’multi-task time-sharing operating system includes , 


sophisticated file management features 
—- Sophisticated resident macro-assembler 
-Customized micro-assembler 
-Superfast, super extended BASIC interpreter 
-True PASCAL compiler (not interpreter) 
- Advanced editor and word processor package 
-More sofware coming 


‘ 


lt should be clear from this list that the HEX-29 minicomputer is a 
powerful'sophisticated design. This is DIRECTLY attributable to 
the availability of the excellent Schotiky technology 1.C.’s availa- 
dle from AMD for use in microprogrammed digital systems. 


in a well designed microprogrammed system there should be 
VERY few random logic gate packages required. In the HEX-29 
CPU, there are only a few gales used as such. If anywhere near 
20‘ of a microprogrammed system is composed of gate pack- 
ages, it is probable that the design can be further simplilied to 
replace the random logic with microcode and/or structured logic 
techniques. It is important to note that the more functions that are 
implemented wilh structured logic and controlled by microcode 
bits. the more versatile and general is the whole design. 


MICROPROGRAMMED MACHINES 


It is highly recommended that AMD's MICROPROGRAMMING 
HANDBOOK be studied before this application note if a detailed 
understanding of the HEX-29 CPU Is desired. The idea Is, of 
course, that the basic principles of microprogrammed machines 
be familiar before this specilic example is examined. The 
Am2900 Learning and Evaluation Kit is also recommended as a 
practical introduction tool. For those only interested in the capa- 
bilities of a well designed microprogrammed CPU, that reading is 
not entirely necessary, and Section V of this Application note will 
be superfluous. Section IV is a more general discussion for these 
readers, but Is also necessary for those going on to Section V. 


A short discussion of microprogrammed systems appears here 
only as a short refrosher for those who have studied the MICRO- 
PROGRAMMING HANDBOOK by John Mick and aim Bnck of 
AMD. 


Any microprogrammed machine can be divided into the following 
two discrete parts: 


1. Control store and microprogram controt 
2. Data routing and function logic 


These two sections of a microprogrammed machine are really 
quite nearly independent. In effect, the control store and micro- 
program control section Is the ‘boss and brains’ of the operation. It 
issues all of the orders and makes all the decisions. The data 
routing and function logic devices are merely puppets that carry 
out the commands selected by the microprogram control logic 
from the control store. Note that ‘microword memory’ and ‘mi- 
crocode' are used interchangeably with ‘control store’ and are 
synonomous. 


Control Store and Microprogram Control 


The control store is simply a number of PROM's. The number of 
locations in this memory is chosen to be large enough to hold the 
desired number of microprogram routines. The width of the word 
is chosen to have sufficient bits to control all of the possible 
functions in the data routing and function logic. Admittedly, RAM 
or EPROM could be used as the memory devices, butit is best to 
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Figure 1... 


think of it as an array of read only memory devices. So, schemali- 
cally an example of a control store array looks like Figure 1. 


In practice, there is a register between the microword data bits 
and the actual data routing and function control devices. This 
register assures that all bits change simultaneously at the begin- 
ning of each new microinstruction cycle and allows the execution 
of one microinstruction with the fetching of the next. The addition 
of this ‘pipeline register’ is shown in the Figure 2 expansion of our 
block schematic. 


The remaining part of this section is the ilemoragram control 
unit, more commonly called the microprogram sequencer. Tha 
microprogram sequencer Is nothing more than a presettable bi- 


nary counter with a few extra functions. Figure 3 shows this 
device in place. 


We show the sequencer as a 12-bit binary counter with a few 
other inputs. The outputs (Y) drive the address lines of the control 
store PROMs. So, each time the system clock rises, the counters 
increment and sequential addresses are accessed from the 
PROM. Note that the current output of the control store 1s cap- 
tured in the pipeline register on this same LOW-to-HIGH transi- 
tion. Thus, the sequencer-Is always fetching the NEXT contro! 
store word which will control the fetching of the next, and so on 
and so on. 


Note that there are several bits from the pipeline register that are 
touted back to the sequencer. in our example, 12 bits are used as 
a microword branch address and another bit is used as a preset 
enable (load) line. Normally, each cycle of the system clock 
increments the sequencer outpuls and the next microword is 


fetched from the control store. However, somewhere down the - 
line we are going to want to branch to a microcode sequence that -— 
1S not ‘In line with the code that 1s currently executing. It is very ~ - 


easy to see how this is done. 
The microaddress of the routine to which we want to branch is 


imbedded in the current microword, 12 bits in our example. The - 


microword bit that is connected to the load input of the sequencer 
is coded to be low on this cycle. So, the sequencer, which is really 
just a 12-bit counter with a unique load control in our example, 
will cause the branch address we selected to pass through to the 
Output of the counters and fetch the microword from the micro- 
address to which we branched. The routine will now continue to 
execute sequentially addressed microwords until we execute 
another branch code. = > anes a 


The only other really necessary function we need from our se- 
quencer is the ability to do conditional branches. In other words, 
we want to be able to branch to some microcode routine. but only 
if a Certain condition exists. As usual, this capability is easily 
added; only one multipiexer is needed. Figure 4 shows the new 
configuration. 


MICROADORESS —34— 


“PIPELINE BITS’ CONTROL DATA ROUTING. 
ANO FUNCTION LOGIC DEVICES 


SEQUENCER 
{BINARY COUNTER) 


toad 


Now two additional microword bits contro! the conditions under 
which a microbranch will take place. If input 0 is selected, a 
branch will always take place since the logic LOW level on input 0 
will appear at the load input of the sequencer. Conversely, if input 
3 is selected, a HIGH logic level is always routed through the 
muitiplexer to the load input and a toad is not performed. Thus the 
Next sequential microinstruction is fetched. So far we can do 
branch and continue functions with the multiplexer. 


If we select inputs 1-or 2 on the condition select multiplexer, we 
may get one of two conditions. If the selected input is HIGH, it will 
be routed to the load input of the sequencer and no load will take 
place. But if the selected condition is at a LOW logic level, the 


- load input of the sequencer is pulled LOW, a toad is performed, 


anda branch has been accomplished. Since a branch only occurs 
when the condition bit is LOW, this function is called a ‘branch on 
condition = 0°. Cleary a ‘branch on condition = 1° can be im- 
plemented simply by inverting the condition bit before it enters the 
multiplexer. : 

So as far as controiling the flow of microgrograms goes, itis clear 
that we can make it look very much like assembly language 
programming of a microcomputer. We can execute sequential 


. microinstructions (in tine code), branch conditionally, or branch 


unconditionally. If we use real live sequencers like the Am2909, 
Am2910 or Am2911 instead of binary counters we get several 
other very important functions including micro-subroutining and 
tooping. 
When we substitute Am2909's, Am2910's or Am2911's as our 
sequencers. the final element of our complete micraprogram 
memory and control section is in place. Figure 5 shows this 
configuration. 


The next address PROM of Figure 5 converts the microcode 
branch function bits into one of two sets of bits that control the 
function performed by the Am291 1's. Which of (he two 1s chosen 
depends upon the logic level of the particular condition bit that is 
selected. 


This is the basic structure of any microprogram contol unit re- 
gardless of what the rest of the system looks like. The width ol the 
microword data word, the microaddress field, the condition select 
field, etc., will change as needed, but the structure remains the 
same. Note that some of the microword data bils are used to 
contro! the microprogram sequencing logic. The bits left over are 
used to control the data routing and function logic in the device: 
iLe., everything else! 


Data Routing and Function Logic 


The data routing and function logic section of a micropro- 
grammed machine closely reflects the job the device 1s to per- 
form. In this respect there is some similarity with random logic 
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designs. The key difference is the glue that binds all of the small 
functional units that make a device work. In a random logic design 
it «Ss a more or less random array of gates and flip-flops that 
interconnect and control these functional units. 


The chief advantage of a microprogrammed machine is that this 
random logic is largely replaced by the coherent sequences of 
contro! bits that is the microprogram. Problems such as race 
conditions, undesirable interactions between functional elements 
and marginal timing nearly disappear in a microprogrammed 
O-cisn, Otten there are one or two internat data buses on which all 
trans'ers of interna! data between functional units take place. 


Think of several possible sources of information that may be 
needed in a particular design. if they are all three-statable de- 
vices, microword bits could be tied to the output enable of each 
and the desired device enabled onto the internal bus on a micro-* 
cycle by microcycle basis. Likewise one or more devices may 
capture this data. Microword bits attached to the clock pulse (CP) 
inputs of registers and the like can achieve this function. 


Further, microword bits select other functions to be performed, for 
example an ALL or shift function. Much of Section V of this 
application note will demonstrate the use of these data fovting 
and funclion logic control bits. 


GENERAL SPECIFICATIONS 


The following section of this application note explores the design 
of the HEX-29 CPU on an intermediate level. It vill be similar in 
detail to the detailed hardware and software spr .ifications given 
lor most microprocessors by the manufacturers. In other words, 
all the information needed to use the HEX-29 CPU, including bus 
timing and instruction set, are examined. This will serve to dem- 
onstrate what can be achieved in a medium level micropro- 
grammed machine. It will also serve as a necessary transition for 
those planning to study the more detailed internal structure of the 
CPU in the next section of this application note. 


It is very important. when designing a microprogrammed ma- 
chine, that the target device be specified in detail approaching 
that given in this section. Only then can an intelligent attempt at 
hardware design begin. It is especially important to define a 
clean, simple, reliable interface between the microprogrammed 
device and other system elements. Considerable attention 
should also be paid to defining data types, instruction formats, 
interrupt requirements, etc. 


internal CPU Registers 


The HEX-29 CPU has 36 internal registers. Of these, 16 are 
memory management (map) registers, 16 are general purpose 
registers, three are associated with the interrupt structure, and 
one is the condition code register. 


Table 1 shows the functions associated with the.16 general pur- 
pose registers of the HEX-29. It is most significant that all 16 
general purpose registers have alternate functions. This should 
not imply that they are not true general purpose registers how- 
ever. Any register can be used as an accumulator, stack pointer, 
index register, memory puinter, data counter, etc., in most in- 
Structions. To increase coding efficiency and execution speed, 
however, some instructions use the defined register assignments 
in Table 1. 


TABLE 1. 


Alternate 
Function — 


Program Counter 
Stack Pointer 
Data Passing 
Y Index Register 
X index Register 
Accumulator 
Counter 
Scratchpad 
FP1 (LSW) 
FP1 
FP1 
FPt 
FPO 
FPO 
' FPO 
FPO 


(MSW) 
(EXP) 
(LSW) 


(MSW) 
(EXP) 


Noles: FP1 = Floating Point Aegister 1. 
FPO = Floating Poift Register 0. 
DW1 = Double Word Register 1. 
DwWO = Double Word Register 0. 


. 


For example, the instruction set of the HEX-29 CPU can load 
immediate, push, pop, and move indexed and direct any of the 
multiple register combinations (FP1, FPO, DW1, DWO) in one 
instruction. One mode of indexed addressing and many byte 
Processing instructions benefit greatly from the alternate use of 
some registers. . 


Condition Code Register 


The condition code regrster contains all zeros in its upper byte. 
The bit assignments in the low byte are shown in Table 2. 


TABLE 2. CONDITION CODE REGISTER BITS. 


Function 


User Flag #2 
User Flag #1 
User Flag #0 


Half Sign Flag (Bit 7; MSb of low byte) 
Zero Flag 

Negative Flag (MSb of result) 

2's Complement overtiow flag 

Carry Flag (arithmetic and shift carry) -_ 


The user flags (U2, U1, U0) are an extra feature of the HEX-29 
CPU. They are not altered by any but five special flag modification 
instructions (SETF, CLRF, COMF, POPF, LDF). These op codes 
set, clear, complement, pop, or load the flags respectively. Since 
the user flags are immune to change except by these special 
purpose flag altering instuctions, they are excellent for passing 
Status information between routines. 


The half sign flag (H) is set if the result of an operation contains a 1 
in the most significant bit of the low byte; otherwise itis cleared. 
This flag is useful in many byte processing and loop counting 
routines. 


Ifthe result of an operation is zero, the zero flag (Z) is set, or else it 
is cleared. This is the most useful of all the tlags and is used on 
_ comparisons, arithmetic and logical operations, loop counting, 
etc.... 
When the most significant bit of the result of an operatiqnis a logic 
1. the negative flag (N) is set. Otherwise it is cleared. Note thatin 
two's complement notation, the most significant bit of a number 
determines the sign of the number. if it is a logic 1, the number 1s 
negative: if it is a logic 0, the number is positive. 


if the two's complement result of an anthmetic operation results in 
a two's complement overtiow, the V flag is Set. This flag ts also 
used as a general error flag by the HEX-29 CPU. For example. the 
V flag is set if a divide by zero instruction is attempted. In floating 
point notation, if the exponent becomes too large or small. (anth- 
metic overflow; underflow), the V flag 1s set to so indicate. 


"The carry flag (C) Is used for two purposes. It is a source and or 
destination bit in shift and rotate instructions, and as a carry-out 
bit when an arithmetic function result is too targe to fit in the 
appropriate destination register. The convention with regard to 
the carry flag on addition and subtraction follows. 


C flag = 1 if 1. Binary add results in a carry out. 
2. Binary subtract results in no borrow. 
C flag = Oif 1. Binary add results in no carry out. 
_ 2. Binary subtract results in a borrow. 
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Alt of the condition code flags, except the user flags, have some | 


special meanings in some of the comptex ‘macro’ instructions. 
These are described in the detailed section on the HEX-29 in- 
struction set. 


interrupt Registers 


There are three special purpose interrupt registers in the HEX-29 
CPU. They are. 


1, Mask Register 
2. Status Register 
3. Vector Register 


These registers are command driven, thatis, the register selected 
1S a funchon of the interrupt command being executed. More 
detatied information on the nature of these registers appears later 
in this application note. 


Memory Management Registers 


A sophisticated memory management structure is embodied in 
the HEX-29 CPU. Integral to this structure is the set of 16 memory 
map registers. These 8-bit registers contain transformation val- 
ues that allow multiple users and tasks to share the processing 
time of the CPU without interacting with each other. Each task 
togged onto the HEX-29 is unique from all others through its 
memory map image When it is chosen to run on the CPU. its 
memory map image becomes synonomous with the CPU mem- 
Ory map registers. More detailed information on this aspect of the 
HEX-29 CPU appears later in this application note. 


Instruction Formats 


The instruction formats of the HEX-29 CPU are simple and few in 
number For this reason, the HEX-29 instruction setis not difficult 
lo learn and use, even though it 1s very extensive and quite 
sophisticated. 


Emphasison the use olacbi (hexadecimal and 8-bit (byte) fields 
in the instruction formats simplity the organization of the instruc- 
on set. All of the instruction formats used in the HEX-29 are 
shown in Figure 6. 
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OP CODE 
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Rs - Source Register (operand, pointer, index reg., slack 
pointer, etc ) 

Ra = Destination Regster (operand, pointer. stach pointer, etc.) 

H = 4-oit (hex) quanuty 

Byte - 68-bit byte idata, index, offset, address. etc.) MPR-86 


Figure 6. instruction Formats. 


Most instructions involve operations on 16-bit words. However, 
the HE X-29 instruction set also includes op-codes that operate on 
the following data types 


1°16. (Bit) 

2. 4 Bits {Hex or Nibble) 

3. 8Bits (Byte) 

4. 16 Bits (Word) 

5 32 Bits (Doub!e Word) 

6 64 Bits (Quad Word Floating Point) 
7. N Bits (Vanable Format) 


In addition to working on the fixed fengih data types, there are 
many “macro instructions that operate on variable length 
character, byte, and word strings in memory. These strings can 
be either contiguous in memory or in the form of Inked lists. 
Several of these ‘macro’ instructions are highly optimized micro- 
coding of the most critical routines used in high level language 
processing. 


The multipheity of data types processed efficiently by the HEX-29 
increases its ability to meet the diverse semancs of modern 
computing. 


Addressing Modes and Assembly Language 


Much of the power and simplicity of the HE X-29 instruction set is 
denved trom the large number of useful addressing modes avail- 
able for the most used basic functions such as MOV, ADD, SUB, 
INC. DEC, CMP, etc. Addressing nodes specify where operands 
of an instruction are to be found and where the resull is to be 
stored. 


The 16 general purpose 16-bit registers are designated RO, R1, 
R2...AC, AD, RE, RF. These are the primary names of the 16 
registers and refer directly to the corresponding registers. Inother 
words, when ‘RD’ is written in a HEX-29 ASsembly Language 


HAL) program, the contents of this regisier are used as an 


operand or destination in the instruction. 


The use of a-register as a pointer to memory ts called memory 
pointer addressing. The names MO, M1, M2,... MC. MD. ME, 
MF apply to the 16 general purpose fenton when thoy are used 
as memory pointers. 


When a register points to a memory location which contains the 
address of the memory location holding the value of interest, the 
register is said to be an “indirect pointer”. The names (0. 11, 
IC, 1D” 1€, iF are used to specify the 16 general purpose 
tegisters when they are being used with this type of addressing. 


indexed addressing is possible using the names ZO, Z1,2Z2,.... 
ZC, ZD, ZE. The use of one of these names means that the datas 
at the address formed by adding the contents of the register 
referenced to the contents of word following the instruction in 
main memory. 


Most often, when a register is used as a memory pointer (MD for 
example), or as an indirect pointer (19 for example), itis extremely 
desirable that the register auto-increment or perhaps auto- 
decrement since programs, lists, and stacks are ordered in a 
positive directibn through memory. 


In HEX-29 Assembly Language (HAL) itis quite simpie to specily 
that a memory, indirect pointer register, etc. is autorincremented 
or auto-decremented by appending a ‘+° of "—" character to the 
tespeciive register specification. 


For example: 


MOV M7+, RE The contents of memory pointed to by R7 is 
moved into AG. R7 is then incremented. 


MOV RA, ME - Decrement RE. Then move the contents of RA 
into the memory location pointed to by RE. 


Wis significant that auto-incrementing takes place after the oper: 
ation while auto-decrementing takes place before the operation; 
(auto-post-increment and auto-pre-decrement.) 


. 


Several very fundamental addressing modes arlse from auto- 
incrementing memory and indirect pointers. Consider the fotlow- 
ing examples: 


A. Program Counter (RF) as an auto-incrementing pointer yields 
‘immediate addressing’. 
MOV MF+, RA = Move immediate into RA. 
ADO MF+,R6 = = Add immodiate to R6. 
MOV MF+, RF = Jump to address in immediate word. 


8. Stack Pointer (RE) as an auto-incrementing pointer yields 


‘stack addressing’. 


MOV ME+, R2_ = Pop top of stack into R2. 
XOR ME+, R1 = = Pop top of stack and XOR into Al. 
MOV ME+, RF = Retuin from subroutine! 


C. General registers used as data stack pointers. 
ADD MD+, MD = = Add top two members of data stack + 
leave result on top of the stack. 

CMP MD+, M8+ = Compare top members of two stacks + 
remove these values from the stacks. 
= AND immediate word with the top mem- 

ber of stack pointed to by R6. 


D. Program Counter (RF) as an auto-incrementing Indirect 
pointer yields ‘direct addressing’. 


AND MF +, M6 


MOV IF+, R7 = Move direct into R7, 
ADD 1iF+, RC = Add direct into AC. 
MOV IF+, IF+ ° = Move direct to direct. 


R should be clear that these examples represent only a few of the 
most useful of many possible uses of auto-incrementing and 
auto-decrementing with memory and Indirect pointers. Careful 
Study of the HEX-29 instruction set will reveal many more uses 
Not examined in these examples. 


Classes of instructions 


The instruction set of the HEX-29 includes many different func- 
tions and a multitude of addressing modes. Nonetheless, all 
instructions fall into ono of two classes of instructions. The gen- 
eral register class of instructions are extremely flexible because 
of the enormous number of variations inherent in each op-code. 
The defined register class of instructions permits extremely fast 
and memory efficient code for often used functions and register 
sets. The power of the HEX-29 instruction set is derived from an 
extensive combination of the most powerful and efficient instruc- 
tions from each class. 


General Register Instructions 


In a general register instruction, the function and addressing 
mode are specified in the op-code field (upper byte). The lower 
byte then holds two 4-bit (hex) values that specify the registers 
used in the instruction. It should be clear, therefore, that for every 
general register instruction there are 256 possible specific ac- 
tions that can be performed. 


The full power of these instructions may not be evident without an 
example. A discussion of just 5 of the 256 possible variations on 
the MOV M+, R instruction will demonstrate the extreme flexi- 
bility of each and every general register instruction. Execution of 
the MOV M+, R instruction proceeds as follows: 

1. Contents of Rs are moved to the address bus. 

2. Rs is auto-incremented by one. 

3. The data addressed by Rs is loaded into Rd. 
in this instruction, As is used as an auto-incrementing mem- 
ory pointer, hence the M+ notation. As and Rd are the source 
an destination registers. Below is the set of 5 examples of 
how the one op-code can be used to implement a number of 
Important functions. 


1. MOV MF+, R3 (W) = Load immediate word (W) into R3. 
2. MOV MF+, RF (W) = Jump direct to address (W). 

3. MOV ME+, R6 = Pop top of control stack into R6. 
4. MOV MD+, R4 = Load next member of list into R4. 
5. MOV ME+, RF = Return from subroutine. 


Taking a few minutes to review this section and understand how 
all of these functions are achieved with the single MOV M+, R 
op-code should reveal the nature of the power and flexibility of the 
general register class of instructions. 


Defined Register Instructions 


A defined register Instruction is an instruction whose function, 
addressing mode, and register assignments are ail defined in the 
op-code field (upper byte). The low byte is then available for use 
as an offset for short relative branching instructions, an im- 
mediate byte or character, an 8-bit index value, an B-bit Icgical 
mask, etc. With this class of instructions, often only one word is 


- required for the entire insuuction. This speeds execution and 


improves coding efficiency markedly in most applications. It is for 
this class of instruction that the altemate register function as- 
signments appear in the model of the HEX-29 register set. An 
example of a one word defined register instruction with the two 
word instruction it can replace follows: 


ADC X,A 26 Defined Register Instruction 
ADC ZB, RA 0026 General Register instruction 


Both of the instructions accomplish the same thing. In both cases 
RB (X index register) is used as an index register and RA (Ac- 
cumulator) is the destination operand. The value in memory at the 
address pointed to by the sum of the X index register (RB) plus the 
hex constant 26 is added to the contents of the Accumulator (RA) 
and the sum left in Accumulator (RA). 


The significant difference between the two instructions is that the 
defined register instruction takes only half as much code (one 
word vs. two), and executes faster since there are fewer memory 
accesses and fewer machine cycies. Very otten the defined reg- 
ister instruction will be adequate for the job. But when the 
choices of registers RB and RA are not acceptable or if an 8-bit 
index offset is not large enough, the general register instruction 
would be the proper choice. It allows any register pair to be 
specified as the index register and destination/accumulator and 
has a 16-bit index offset in the word following the instruction. 


As mentioned earlier, it is largely the ability of mixing defined and 
general purpose instructions freely that makes programs written 
in HEX-29 Assembly Language very code efficient and fast. 


HEX-29 Instruction Set 


The HEX-29 Instruction set is quite extensive. It not only of- 
fers all of the basic functions in a wide variety of addressing 
modes, it also includes a multitude of special purpose instruc- 
tions. These special purpose instructions cover many impor- 
tant aspects of programming including program control, 
numeric processing, string manipulation and searching, list 
processing, etc. 


Fortunately, all of these types of instructions fall into one of 
only fcur different instruction formats. These were shown in 
Figure 6.Table 3 shows all of the instructions for the HEX-29 
machine. 


265 


266 


TABLE 3. SUMMARY OF MNEMONICS ARITHMETIC OPERATIONS. 


AOC 
“AOD 
ADDB 
AODH 
OADD 
FADD 


SBB 

SuB 

SUBB 
SU8H 
RSUB 
OSUB 
FSUB 


UMUL 
SMUL 
OMUL 
FMUL 


UDIV | 
SpIv 
DDIV 
FOIV 


CMP 
CMPB 
CMPBA 
CMPH 
CMPHN 
CMPHA 


Add words plus carry 
“Add words wio carry 
Add byte to word 
Add hex value (nibble) to word 
Add double word values (32 bits + 32 bits -» 32 bits) 
Add floating point values (64-bit FP + 64-bit FP — 64-bit FP) 


Subtract-with borrow 

Subtract w/o borrow 

Subtract byte frem word 

Subtract hex value irom word 

Subtract words in reverse order 

Subtract double word values (32 bits — 32 bits — 32 bits) 
Subtract floating point values (64-bit FP — 64-bit FP > 64-bit FP) 


Unsigned word multiply (16 bits * 16 bits -» 32 bits) 
Signed word multiply (16 bits * 16 bits —> 32 bits) 

Double word signed multiply (32 bils * 32 bits -» 64 bits) 
Floating point multiply (64-bit FP ° 64-bit FP —> 64-bit FP) 


Unsigned word divide (16 bits + 16 bits — 16 bits + 16-bit remainde:) 
Signed word divide (16 bits + 16 bits + 16 bits + 16-bit remaindes) 
Double word signed divide (32 bits + 32 bits > 32-bit + 32-bii remainder) 
Floating point divide (64-bit FP + 64-bit FP -> 64-bit FP) 


‘Compare words 

Compare byte with word 

Compare byte with byte , 
Compare positive hex value (nibble) with a word 
Compare negative hex value (nibble) with a word 
Compare hex value (nibble) with another nibble 
Compare signed double word values 


Compare floating point values 


Negate word (2's complement) 

Negate signed double word value 

Normalize floating point number 

Test signed double word value for zero + sign 
Test floating point value for zero + sign 


Increment word by one 
Decrement word by one 


Rotates 


Arithmetic shift nght 

Arithmetic shift left 

Count and shit left (until MSb=1) 
Double word shift left 

Dopble word shift right 

Logical shift right 

Rotate closed lett 

Rotate left (through carry flag) 
Rotate right {through carry flag) 
Vanable shift left (0 to 15 places) 
Variable shift right (0 to 15 places) 


Mt a i er Re 
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TABLE 3. SUMMARY OF MNEMONICS ARITHMETIC OPERATIONS. (Cont.) 


Boolean AND words 

Boolean AND byte with word 
Boolean inclusive OR words 
Boolean inclusive OR byte with word 
Boolean exclusive OR words 
Boolean exclusive OR byte with word 
Complement word ° 
Clear the specified 2 registers 

Bit set 

Bu clear 

Bit invert 

Bit test 

Clear specified fiags 

Set spectiiad flags 

Complement specified flags 


Data Movement 


MVN~ Move, no flags altered 

Move, update flags 

Move multiple words 

Move a byte 

Load a byte 

Store a byte 

Move a positive nibble 

Move a negative nibble 

toad immediate 2 registers 

Exchange contents of two registers 

Exchange contents of DW1 and DWO 

Exchange contents of FP1 and FPO 

Exchange top two members of any stack 

Duplicate top member of any stack 

‘Switch’. Store register indexed and reload indexed 
Move .any bit field from one word to another - 

Swap high and low bytes in a word 

Push any two registers onto control stack 

Pop top two words on control stack into two registers 
Push flags (condition code register) onto control stack- 
Pop top of control stack into condition code register 
Push 8 registers onto control stack 

Pop & registers from control stack 

Push R8, R9, RA, RB, RC, RD onto control stack 
Pop RS, RY, RA, RBRC, RD from control stack 
Load interrupt register 

Read interrupt register 

Read a memory map location 

Load a memory map location 

Fill memory map 

Block move bytes forward in memory 

Block move bytes reverse in memory 

Block move words forward In memory 

Block move words reverse in memory 


PROGRAM CONTROL 


Execute contents of register as an instruction 
Retum from interrupt 

Breakpoint trap 

Jump if specified flags are set 

Jump if spscified flags are clear 

Catt subroutine if specified flags are set 
Cail subroutine if specified flags are clear 
Jump indirect if specitied flags are set 
Jump indirect if specified flags are clear 
Call subroutine if specified flags are set 
Call subroutine if specified flags are ciear 
Retum from subroutine if specified flags are set 
Retum from subtoutine if specified flags are clear 
Jump to the address specified 

Call subroutine 

Call executive (soltware interrupt) 
Branch it greater than 

Branch if greater than or equal 

Branch if less than 

Branch if iass than or equal 

Transition table word branch 

Transition table byte branch 

Decrement and Branch Non-Zero 

Branch on zero or decrement 

Compare and branch if in bounds 
Branch 

Branch to subroutine 

Branch if carry flag set 

Branch if carry flag not set 

Branch if overflow flag set 

Branch if overflow flag not set 

Branch if negative flag set 

Branch if negative flag not set 

Branch if zero flag set 


_ Branch it zero flag not set 


Branch if half sign flag set 
Branch if halt sign flag nol set 


Miscellaneous Instructions 


NOP 
*SCNB 
*SCNW 


” *SEAF 


“SEAL 


No operation for 2 to 256 cycles 

Scan for match with specified byte 

Scan for match with specified word 

Basic fixed entry length list search 

Basic variable entry length linked list search 


“These ‘macro’ instructions are examined in more detail on the folowing pages. 
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SMUL 


SUMMARY OF SELECTED 'MACRO’ INSTRUCTIONS 


Unsigned 16-bit multiply 

16 bits 16 bits —- 32-bit answer 

R3 R2--R3 (MSW of answer) 
— R2 (LSW of answer) 
— Ri (LSW of answer) 
— RO (MSW of answer) 


If V=1 then RO is not zero (Answer is longer than 16 bits) 
11 N=1 then MSb of RO = 1. (No particular significance) 
It Z=1 then answer is zero. (Ri and RO are cleared) 


Signed 16-bit multiply (Two's complement notation) 
16 bits 16 bits —> 32-bit answer 
A3 R2-—-R3 (MSW ol answer) 
— R2 (LSW of answer) 
—~-R1  (LSW of answer) 
—» RO (MSW of answer) 
If V1 then answer is longer than 16 bits (overflowed LSW) 
{t N=1 then answer is negative 
if Z=1 then answer is zero (R1 and RO are cleared) 


Unsigned 16-bit divide 

16 bits / 16 bits —» 16-bit answer and 16-bit remainder 

R3/ R2 — R2 R3 holds remainder 

lf V=1 then an attempt to divide by zero was refused 

i N=1 then MSB of answer = 1 

If 2=1 then answer is zero. (A2 = 0 R3 need not be zero) 


Signed 16-bit divide (Two's complement notation) 

16 bits / 16 bits —» 16-bit answer and 16-bit remainder 

R3/ R2 -» R2 R3 holds remainder 

It V=1 then an attempt to divide by zero was refused, or overflow 
If N=1 then the answer is negative 

If Z=1 then the answer is zero (R2 = 0 R3 need not be zero 
R3 has sign of numerator 


Double word signed add 

32 bits + 32 bits —» 32 bits 

DW1 + DWO -- DWO fe. - -R3,R2 + R1,R0 — R1,R0 
The C flag is treated the same as in Rogie word addition 

If V=1 then a two's complement overflow occurred 

If N=1 then the answer is negative 

H Z=1 then the answer is zero 


Double word signed subtract (Two's complement notation) 

32 bits — 32 bits —- 32 bits 

DW1 - DWO-- OWO ie. R3,R2 — R1,RO — R1,R0 

The C flag is treated the same as in sing!e word subtract 

IF V=1 then a two’s complement overflow occurred 

If N=1 then the answer is negative 

If Z=1 then the answer is zero 

Hf one divides ‘'8000" by ‘‘FFFF” (—32768 + —1) the answer is ‘‘8000"" (+32768). However, 8000 is a negative 
number in two's complement, so an overflow has occurred 


Double word signed multiply 
32 bits = 32 bits + 64 bits 
Ow1 OWO0 — OW0,0W1 je. R3,R2  R1,RO > R1,R0,R3,R 
NOTE: The order of the answer words is as follows: 
MSW — R2 
MSW-1-—+ R3 
MSW -2-—» RO 
MSW-3— Ri (LSW) 


SUMMARY OF SELECTED ‘MACRO’ INSTRUCTIONS (Cont) 


The reason for this seemingly unnecessary odd oder concerns the results that are desired in OWO (RO,R1) at the end 
of the operation. The desired result of 32-bil math operations are nearly atways 32-bit answers. However, a 32- 

bit ¢ 32-bit multiply can generate up to 64 bits. Therefore, the teast significant 32 bits of the answer are stored in 
OWO where the answer is expected on ail double word (DW) instructions. The most significant 32 bits must be stored 
in OW1, therefore the seemingly reversed order of storage. If the V flag = 0 at the completion of an operation, then 
only the 32 bits in DWO are significant and the user program can store this 32-bit double word without fear of losing 
significant bits. So. in the normal situation where only the least significant 32 bits of the answer is desired and the 
more significant 32 bits of the answer does not contain significant bits, the answer 1s where the normal convention 
Specifies: in DWO. If the V flag is found set and it is desirable to save the 64-bit result rather than go to an error 
routine. a simple OXCH will exchange the contents of OW1 and DWO and leave the 64-bit answer in a logical order 
with the MSW in AO and the LSW in R3. It can then be buffered with any of the floaung point register O butter 
instructions. If V=4 then the answer has greater than 32 bits of significance. 

tf N=1 then the answer is negative 

If Z=1 then the answer is zero 


ODIV Double word signed divide (Two's complement notation) . 
32 dils / 32 bits —» 32-bit answer and 32-bit remainder 

DW1 / OW0 -- DWO Remainder —» DW1 

if V=1 then attempted divide by zeso was refused, or overflow 

It N=1 then answer is negative : 

if Z=1 then answer Is zero - (DWO = 0. - OW1 nol tested) 


OCMP Double word compare (Two's complement notation) 
32 bits — 32 Dils — Nowhere (Update V.N.Z flags) 
Ovit - OWO — Nowhere 
The C flaa is treated the same as in a single word compare 
~ {t V=1 then a two's complement overflow occurred 
it N=1 then the difference is a negative value 
if Z=1 then the difference is zero 


OXCH Double word exchange 
Operates on any contents of OW1 and D'W/0 
DW1-—- TEMP) OW -» OW1 TEMP —» DWO 
DW1 = R3 and R2 
DWO = R1 and RO 
No flags are altered 


ONEG Double word negate (Two's complement notation) 
0000 0000 — 32 bits — 32 bits 
0000 0000 - DWo — DWO0 
lf V=1 then a two's complement overflow occurred OWO = 8 GO00 0000 
If N=1 then the fina! value in DVO is negative 
If Z=1 then the final value in DWO is zero 


TST DWO Ooub!e word test value (Two's comp!ement notation) 
Set flags based upon the contents of DW0 
0000 0000 + DWO -» Nowhere = (Update V.N.Z) 
if V=1 then a valid 2's complement value overflows the LSW 
If N=1 then the value in DWO is negative 
If Z2=1 then the value in DWO is zero 


FPADO Floating point add Double Precision (64 bits) 
. Standard HEX-29 floating point format 
FP1 + FPO — FPO 
If V=1 then an overflow in the 2's complement exponent occurred 
If N= 1 then the answer is negative 
If Z=1 then the answer is zero 


FPSUB Floating point subtract Double Precision (64 bits) 

Standard HEX-29 floating point format - 

FP1 — FPO —» FPO ’ 

If V=1 then an overflow in the 2’s complement exponent occurred 
if N=1 then the answer is negative 
- If Z=1 then the answer is zero 
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FPMUL 


FPDIV 


FPCMP 


FPNRM 


FPXCH 


TST FPO 


SEAL 


‘ 


SUMMARY OF SELECTED ‘MACRO’ INSTRUCTIONS (Cont.) 


Floating point multiply Double Precision (64 bits) 

Standard HEX-29 floating point format 

FP1 FPO — FPO 

lf V=1 then an overflow in the 2's complement exponent occurred 
If N=1 then the answer is negative 

lf Z=1 then the answer is zero 


Floating point divide Double Precision (64 bits) _ 

Standard HEX-29 floating point format 

FPt / FPO -» FPO : 

Mf V=1 then an overflow in the 2's complement exponent occurred or negative zero refused. 
{1 N=1 then the answer is negative 

If Z=1 then the answer is zero, 


Floating point compare § Double Precision (64 bits) 

Standard HEX-29 floating point format 

Compare the magnitudes of FP1 and FPO 

IfN XOR V = 1, then FP1 < FPO 

I Z=1 then FP1 =-FPO ‘ 


NOTE: WE HAVE TO FURTHER DEFINE THE WAY THIS WORKS, BUT THIS INSTRUCTION WiLL SET THE FLAGS 
SUCH THAT THE 2's COMPLEMENT BRANCH ON THE EF PAGE WILL WORK!!! 


Floating point normalize Double Precision (64 bits) 

Standard HEX-29 floating point format 

The sign of the mantissa must be in the MSb of the exponent word before this instruction is executed 

Shift mantissa lett and increment exponent until MSb of the MSW of the mantissa is one. (Operates on FPO only) 

{f V=1 there was a 2's complement overflow of the exponent 
ne C flag is trashed 

N=1 result is negative 

Z=1 result is zero 


Floating point exchange Double Precision (64 bits) 
Operates on any contents of FPi & FPO (R7 thru RO) 
FP1 —- TEMP FPO — FP1 TEMP —> FPO 

FP1 = R7, R6, RS, R4 

FPO = R3, R2, Ri, RO 

No flags are altered 


Floating point test Double Precision (64 bit) 
Standard HEX-29 floating point format 

Set the flags based upon the contents of FPO 

Mf N=1 then the value in FPO is negative 

If Z=1 then the value in FPO is zero 


BASIC stnng variable / numeric or string matrix element search 
The SEAL instruction provides a very flexible way to rapidly and efficiently search linked lists for a particular entry. _ 
in each entry in the list, the first two 16-bit words are ordered as follows: The first word of each entry is the link 


Offset to the next entry in the linked list. The second word is the entry name word. Any 16-bit value can be used in 
this field. 


The name of the entry to be searched for must be put in the accumulator (RA) before this instruction is executed. The 
format of the instruction is as follows: 


SEAL F,Md where F (s the literal binary value 1111. 


The destination field of the instruction’ (Md) specifies the register that must point to the beginning of the linked jist. 
Starting at this point, this instruction will link its way thru the list looking for a match between the word after the link 
offset word (the entry name) and the contents of the accumulator (RA). 


At the completion of the instruction, the Z flag indicates the results of the instruction in the following manner: 


Z=1 No match was found in list (End of list reached) 
z=0 A match was found and Md is pointing to the word after the entry name that matched the accumulator 
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SUMMARY OF SELECTED ‘MACRO’ INSTRUCTIONS (Cont.) 


Since the link offset word is a two’s complement value, it can fink to any other location in memory. The link offset is 
equal to the difference between the address of the next link offset word and the address of the current link offset 
word, minus one. 


Note that this instruction can be used to search linked lists with entry names that are much fonger than 16 bits with 
ease. For example, if the entry names to be matched are 2 words long, all that need be done is to compare the word 
at which the pointer is aimed with the second word of the desired variable name. If it matches, then the pointer now 
points to the first element in the list after the double word entry name. If it does not match, the search can be 
continued by backing up the pointer to the link offset of the current entry and re-executing the SEAL instruction. 


Al the completion of the instruction, the contents of the register specified by the Md fieid in the instruction will 
contain the address of the word AFTER the variable name in the fist entry that matched the one in the accumulator 
(RA). At the completion of the instruction the Z flag will indicate the results of the instruction execution. If the Z flag 
is al a zero level, the search was successful and the pointer to the table (Md) contains the appropriate value. On the 
other hand, if the Z flag is set to a one level, no match to the variable name in the accumulator was found anywhere 
in the linked list. 


LO VN da da...da da LO VN da... da da LO VN... 


LO = Link Offset word 
VN = Vanable Name word 
da = data entries irrelevant to instruction 


Basic fixed link offset variable search 


The SEAF instruction provides a very flexible way to rapidly and efficiently search lists for a particular entry. It is 
slightly different from the SEAF instruction in that the link offset word is not imbedded in the list entries. instead, this 
instruction assumes that all list entries are of the same length (even though the intemal formats may vary). The value 
of the link offset is the immediate word following the BASF op code word. 

Perhaps thé most obvious use of this instruction is for searching a numeric variable list for a specific variable name 
followed by the value. The lists entries can be any length, so single and double word integers and lloating point lists 
can all be handled with equal ease, but not all with the same instruction since the list entries will not be the same 
length for afl of these. 

The link offset word following the instruction is a two's complement number. Therefore, any fixed length can be 
searched forwards or backwards in memory. The link offset constant equals the number of words in each list entry, 
or its 2's complement for a backwards search. 

Again the variable name word to be searched for must be put into the accumulator (RA) before the BASF instruction 
is executed. And the contents of the destination field register (specified by Md) points to the first element of the list. 
The form of this instruction is shown below: 


SEAF 0,Md where 0 = binary 0000 
Scan for word . 
The SCNW instruction Is of the following form: 


SCNW Ms,Md 


This instruction scans a table of words (pointed to by Rs) for 2 match with the contents of the accumulator. Each 
time a word is fetched from the table, Rd is incremented. If Ad contains zero at the beginning of the instruction, then 
it will contain the number of the words searched in the source table before a match with the accumulator occurred. 


Altematively Rd may contain a pointer to another table. When a match between the accumulator and the source table 
occurs, Rd will point to a corresponding entry in the ‘destination’ table. 


If the source list pointer and the destination list pointer are the same, then the two tables are interleaved; ie. the 
combined list would start: 


Source list word #1 
Destination list word #1 
Source list word #2 
Destination list word #2 
Source list word #3 
etc. 
etc. 
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SUMMARY OF SELECTEO ‘MACRO’ INSTRUCTIONS (Cont.) 


This instruction can be very useful in command processing routines and for searching lists that are not finked within 
the list itself (see BASS and BASF). 


The ijast entry in the source list must be a zero. If no matches were found previous to this zero word, then the Z flag 
is set. it the Z flag was not set, then a match was found and the pointers are valid. This instruction is interruptable 
on a word by word basis. 2 


Scan tor byte 


The scan for byte instruction (SCNB) works identically to the scan for word instruction except that the source list 
contains bytes packed into words. Thus the source list is only halt as long as the destination list (if there 1s one). 


Note that both lists must start on word boundaries. Only the low byte of the accumulator Is used in the compare with 
the source bytes. The contents of the accumulator are not affected by the instruction. This instruction is interruptable 
on every other byle that is compared. The Z flag has the same meaning as for the SCNW instruction. 


Instruction Matrix 


A convenient way to present all of the basic op-codes of the 
HEX-29 CPU is by way of an ‘instruction matrix’. The eight-bit 


Op-code in the upper byte is broken into two nibbles. The most- 


Significant nibble of the op-code appears on the lett side of the 
matrix shown in Figure 7. The lower nibble appears along the top 
tow. The second matrix shown in Figure 8, is called the ‘extended 
function’ matrix. In the HEX-29 CPU, the low byte of the insiruc- 
on word is interpreted as an ‘extended function’ op-code if the 
upper byte is an ‘EF’ hex. 


Memory Management 


The HEX-29 incorporates a sophisticated memory management 
structure. Though very clean and elegant in implementation, the 
* Capabilities of the processor are greatly extended by this circuitry. 
Transparent to the user not requiring its many features, this 
Structure is vital to many very important applications; most sig- 
nificanuy the support of multi-user, multi-task, time-sharing 
operations. 


To all programs executing on the HEX-29, all memory addresses 
are 16 bits long. But before these 16 lines reach the system bus, 
they pass through the memory management section of the 
HEX-29 CPU. In this circuitry, the most significant four bits 
(A15-A12) are ‘mapped’ into eight bits on the bus (a ‘write-protect’ 
bit (WP) and seven address lines (A18-A12)). The net increase of 
three address bits expands the total addressable memory space 
to 512k words or 1 Megabyte. The WP bit is used to write protect 
the memory in blocks as desired by the executive program. 


Since each of the 16 locations In the memory map represents a 4k 
word block (or page), up to 64k words can be addressed by a 


program at any time. Any location in the memory may contain any 
&-bit value, so memory that is contiguous to a program need not 


be contiguous in physical memory. For clarity, Figure 9 shows 


schematically how this ‘memory mapping’ works. 

The tow 4k words of physical address space is reserved for the 
nucleus of an operating system; also called an executive or 
supervisor program. This is called physical page zero. The con- 
tents of the memory map can only be altered if the low location of 
the memory map contains all zeros. Since this is synonymous 
with the physical page zero address block, only the executive 
program is able to change the contents of the memory map. And 
since all VO devices and channel control blocks are located in 
physical page zero, all l/O must also be done through the execu- 
tive program. Likewise, all hardware and software interrupts in- 
voke the supervisor automatically. 


Because of this simple but fool-proof security scheme, complete 
protection of ail users memory space and VO devices can easily 
be maintained by the executive program. 


Also note that the supervisor program can safely make programs 
that are re-entrant available to several users simultaneously as 
long as it write protects the code. Since user programs are often 
No larger than the host program under which it is running, this 
technique can result in a savings of 30% to 50% in system 
memory usage. 


Occasionally, for special purposes, a single user may wish sole 
access to the entire resources of the system. Examples would 
include programs too large to run in a single user's 128k bytes of 
memory. Or perhaps a new W/O access method. In any case, itis 
possible for a single user on the system to gain complete control 


Figure 7. HEX-29 instructions. 
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Figure 9, Memory Mapping Program Address (Logical Address). 


and access to the system by assigning himsel! as the executive 
program. This can only be accomplished alter a system reset, 
Hence only those with physical access to the computer (and who 
have a reset key) can accomplish this operation. This user is then 
empowered with all of the features and capabilities of the 
machine with no limitations. Direct access to all of the system I/O 
devices, the enlire interrupt structure, the memory map, etc., is 
then at the command of the single user in the executive or 
supervisor mode. 


Most often, each user needs only one or two 4k pages of memory 
in addition to the host program which is probably shared. Thus it 
would be very wasteful if each user were to have access to a full 
65k words of physical memory space. For this reason, a page of 
physical memory has a special designation in the system. 


The highest possible physical address block when write pro- 
tected is called the ‘invalid access’ block. Whenever a user ac- 
cesses memory that the supervisor has mapped into the invalid 
access block, the processor ‘iraps’ to a special location in the 
Supervisor program called the ‘invalid access trap’. This occurs 


before the current machine cycle is completed. This is treated 
identically to an interrupt by the processor except that the current 
instruction is not completed. 


Any number of actions can be taken by the supervisor at this time. 
This will usually depend upon the resources of the machine and 
the circumstances under which the problem arose. For example, 
the executive program could inform the program that its memory 
space had been exceeded, or perhaps just allocate another block 
of memory to that user's memory map and continue the execution 
of the ofiending program. A more detailed discussion of the 
sequence of events that takes place upon an invalid access 
appears in the section on the interrupt structure of the HEX-29 
CPU. 


The highest physical address page, when not write protected, Is 
called the ‘dead page’. No action-of any kind takes place in this 
block and there is no memory there for the program to reference. 
Any number of pages from any number of users may be assigned 
to this physical page without fear of interaction. This is the block 
that will normally be assigned by the executive program to all user 
areas that are not needed or are not to be used. 


Interrupt Structure 


The HEX-29 CPU contains a powertul interrupt structure. As with 
memory management, this aspect of the CPU operation is largely 
transparent to users of the system. In most applications the HEX 
OPERATING SYSTEM FOR TIMESHARING (HOST) program 
services all interrupts. Nonetheless, it is useful to know the basic 
Structure of the interrupt system. The three types of interrupts 
serviced by the HEX-29 CPU are examined in the following 
paragraphs. 


The hardware interrupts are caused by signals from physical 

devices outside of the processor. These signals, generated by 

peripherals, their controllers, or the realtime clock, serve to notify 

Hi CPU of some condition or requirement of the interrupting 
vice. 


The HEX-29 CPU has eight hardware interrupts. They are indi- 
vidually maskable and are prioritized into eight levels. Each prior- 
ity level has its own vector associated with it. In other words, each 
interrupt level has a corresponding memory location through 
which program control is passed upon that levet interrupt. These 
memory locations are witnin the defined executive page (physical 
page 0) and thus all interrupts cause the HEX-29 to switch into 
executive mode automatically. The eight hardware interrupt 
levels and the associated memory locations are shown below. 


Hardware Interrupt Level! 
Highest Priority 7 


Memory Location of Vector 


5 
4 
3 
2 
1 
{?) 


Lowest Priority 


So, for example, when an interrupt occurs on level 3, the HEX-29 
CPU will enter supervisor mode, save the users PC and SP, and 
call the appropriate service routine at the address stored in mem- 
ory location 0403,,. 


Normally, each hardware interrupt level is reserved for a class of 
devices such as hard disc controllers, floppy dis¢ controllers, 
serial channels, etc. If, for example, there are eight serial devices 
that are interrupting on level 0, the service routine is required to 
locate the one (or more) devices that are requesting service on 
thatinterrupt level and processes them accordingly. This could be 
done by polling all the serial devices whenever the interrupt was 
received. A more efficient technique, used in the HEX-29 system, 
is to further prioritize the like devices on a given interrupt level. 
Then when an interrupt occurs, a vector is read by the execulive 
program that instantly informs it of the highest priority device 
requesting service on that level. When that device is serviced, the 
vector is read again to locate any other devices in need of service 
{if any), and finally resumes normal program execution when all 
devices are serviced. 


A software interruptis an Instruction that, when executed, causes 
an interrupt to occur. The mnemonic used for this op-code in the 
HEX-29 CPU is ‘CEX', which stands for ‘call executive’. This 
instruction passes an 8-bit vector to the ‘HOST operating system 
which is used to determine the action requested by the program 
executing the CEX. Except that this interrupt is caused by a 
program rather than a physical device, the CEX operates in the 
Same manner as a hardware interrupt. It vectors through memory 


location 040C. A pseudo software interrupt is the breakpoint 
‘BPT instruction which vectors through memory location 0408. 
The BPT instruction does not pass an 68-bit vector to the executive 
and is thus useful in program debugging. 


The third type of interrupt is called a ‘trap’. A trap takes place 
when certain conditions occur that require the processor's im- 
mediate attention. For example, if the program cyrrently running 
on the CPU tries to execute an op-code for which there is no 
defined instruction, an ‘invalid instruction trap’ occurs. This is 
essentially a service to notly a user that his program was defec- 
tive and that an attempt was made to execute an op-code which 
has no meaning. These locations are left blank in the instruction 
matrix since they can subsequently be defined as new instruc- 
tions. This ‘trap’ vectors through memory address 040D and acts 
identically to all other interrupts. The only oiher trap in the HEX-29 
CPUis the ‘invalid memory access’ condition. This is discussed in 
more delail in the previous section on memory management. The 
‘invalid memory access’ trap vectors through memory address 
0408. 


Table 4 shows the memory locations that are defined in the 
HEX-29 for interrupt handling. 


TABLE 4. INTERRUPT MEMORY LOCATIONS. 


System Defined Uses 


Reserved 

Reserved 

Vector for invalid instruction trap 
Vector for call executive (CEX) instuction 
Vector for breakpoint (BPT) instructon 
Temperature storage tor user stack pointer 
Temperature storage for executive stack 
pointer 

Vector for invalid memory access trap 
Vector for hardware interrupt leve! 7 
Vector for hardware Jnterrupt level 6 
Vector for hardware interrupt level 5 
Vector for hardware interrupt level 4 
Vector for hardware interrput level 3 
Vector for hardware interrupt level 2 
Vector for hardware interrupt level 1 
Vector for hardware interrupt level 0 


Memory 
Location 


Again, note that all interrupts are processed identically so that the 
one return from interrupt (RT!) instruction properly terminates all 
interrupt service routines. 


DMA/REFRESH CONTROL 


In order that an efficient multi-user or multi-task system be im- 
plemented, it is necessary that the processor not be burdened 
with the relatively slow transfer of programs and data between 
system memory and mass storage devices such as floppy and 
hard disks. For this reason, the controllers for these devices are 
designed wilh a high degree of intelligence and self-reliance. 
These controllers take virtually all of ihe burden of mass storage 
transfers upon themselves. This frees the HEX-29 CPU to exe- 
cute programs for all users nol waiting for these mass storage 
translers to take place. Because these controllers are essentially 
separate special purpose microprogrammed CPUs, they are 
often called ‘peripheral processors’, ‘channel processors . or just 
‘channels’. 
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For this scheme to be effective, both the CPU and the channe! 
processors must be accessing system memory concurrently. 
Fortunately, the inherent structure and operation of the HEX-29 
CPU is eminently suited to this requirement. 


In every instruction there is at least one machine cycle during 
which the HEX-29 CPU is decoding or intemally executing an 
instruction. During these machine cycles the CPU does not use 
the system bus; the system bus and memory are available for 
access by devices other than the HEX-29 CPU. This is called a 
‘Free DMA cycle’ or ‘bus available’ cycle. During these machine 
cycles a channel processor may read or write memory without 
interfering with, or assistance trom the HEX-29 CPU. The act of 
accessing system memory by any device other than the CPU is 
called ‘direct memory access’ or DMA since the channe! pro- 
cessor is directly accessing system memory without CPU assis- 
tance or intervention. 


Resident In the HEX-29 CPU is a very clean, very powerful 
multilevel prioritized OMA structure. Within this structure up to 
ten groups of devices can share the system bus on a priority 
basis. Normally the priority levels are assigned on the basis of 
transfer speeds . . . the faster the device is able to support mem- 
ory transfers, the higher the priority itis assigned. In this manner 
several channel processors can access system memory concur- 
rently at the intervals they require. The DMA structure of the 
HEX-29 CPU can support very high combines transfer rates with 
multiple DMA devices using this technique. With high speed 
memory, the HEX-29 CPU need not even slow down its program 
execution to support a concurrent combined DMA transfer rate of 
4 Megabytes per second. With slower memory, this figure drops 
to about 2 to 3 Megabytes per second. Even this slower rate 
corresponds to concurrent DMA by one high speed hard disk plus 
several floppy disks plus room to spare. Still, the CPU can be 
halted, if necessary, to achieve combined DMA rates of up to 12 
Megabytes per second maximum. 


The support of dynamic memory in the HEX-29 system is simpii- 
fied by signals associated with this DMA structure. Whenever 
there are no devices requesting the bus for DMA, a signal on the 
bus indicates this condition. Dynamic memory refresh controllers 
can take advantage of these unused free DMA cycles to refresh 
intemal dynamic RAM chips if desired. Even when very heavy 
use of the bus by DMA devices occurs, itis unlikely that too few of 
these unused free DMA cycies will be available for the dynamic 
memory refresh controllers. In this event, however, another signal 
can be used to disable all other DMA priorities and allow the 
refresh controllers as much time as is required. 


SYSTEM BUS AND TIMING 


When specifying the bus signals and their timing relationships 
dunng the earty design stage of the HEX-29 CPU, utmost atten- 
tion was paid to simplicity and reliability. The result is that there 
are very few signals required to interface to the bus properly, and 
the timing requirements are quite straight forward and easy to 
meet. 


The fotlowing section is a descnption of the mnemonic names and 
functions of the HEX-29 system bus signals: 


System Bus 


A18-AO 
(Address Bus) 


Three-state outputs. A18-A0 are the 19 physical 
address lines of the HEX-29 system address 
bus. A18 is the most significant bit, AO is the 
least significant bit. These outputs are three- 
stated whenever the bus Is available (BA is low). 


D15-D0 
(Data Bus) 


WP (also WE) 
(Write Protect) 


RW 
(Read/Write) 


VMA 
(Valid Memory 
Access) 


CLK 


Three-state and bi-directional input/outputs. 
D15-DO are the 16 lines that make-up the 
HEX-29 system data bus. D15 Is the most sig- 
nificant bit, DO is the least significant bit. 


Three-state output. WP is used to protect 
areas of memory from being written. Practically 
speaking this signal is active-LOW and would 
have been called WE (Wnte Enable) if not for 
possible confusion with the read/wrile signa! 
which also must be LOW to write merfory. 


Three-state output. The R/W signa! determines 


- whether aread or write operation is performed. A 


LOW level of the R/W line indicates a write 
memory is to be performed if VMA (valid memory 
access) is also LOW when the system clock 
(CLK) goes LOW. ’ , 


Threo-state output. VMA is LOW during al! 
cycles that a memory access (read or write) 
will be performed by the processor. 


Output, not three-state. CLK Is the system 
clock. All timing in the HEX-29 system is defined 
relative to this signal. For convenience, the 
period of each machine cycle that the clock is 
high is called ¢, (phase 1) and the period that it 
Is low is called ¢2 (phase 2). All system ‘chip 


selects’ are derived from this signal. 


BA 


(Bus Available) cycles durin 


STR 
(Stretch Clock) 


cLR 
(Clear) 


17-16 
(Interrupts) 


Output, not three-state. SOMA is mnemonic 
for ‘synchronize direct memory access’. This 
bus signal is LOW the cycle before DMA is per- 
missible. The sole purpose of this signal is to 
notify OMA devices early of an upcoming ‘free 
DMA’ cycle. This will make it easier to ‘grab the 
bus’ very early in a ‘free DMA’ cycle to improve 
the address generation timing. 


Output, not three-state. BA is LOW on all 
which DMA Is permitted by the 
CPU. When BA is LOW, all three-stateable out- 
puts from the HEX-29 CPU card are tuned off 
and control is relinquished to DMA devices for 
the current cycle. BA is mnemonic for ‘bus avail- 
able’. 


Input to HEX-29 CPU. When an addressed de- 
vice is not fast enough to be reliably accessed 
(read or written) within the minimum access time 
of the HEX-29 CPU, it should pull the STR signal 
LOW. For each 40ns that STA is held LOW, the 
system clock is lengthened by 40ns and thus the 
access time required of the addressed device. 
This signal can be held LOW for as many as 
40ns increments as required to meet the access 
time of the addressed memory or I/O device. 


Output, not three-state. CLA is a LOW level 
pulse which is just a ‘cleaned up" RESET signal. 
Any device that requires an initialization pulse 
should use this line. 


Inputs to HEX-29 CPU. 17-16 are the eight 
hardware interrupt inputs. 17 is the highest prior- 
ity and 0 is the lowest. These Inputs are nega- 
tlve edge catching; that is, an interrupt signal is 
recognized by the interrupt circultry in the 
HEX-29 CPU when the line goes LOW. These 


A7-F6 
(DMA 
Requests) 


07-50 
(DMA 
Acknowledge) 


NRO 
(No‘DMA 
Request) 


ODMA 
(Disable OMA) 


FETCH 
(Fetch 
Instruction) 


osc 
_ (Oscillator) 


lines should be driven by open collector outputs 
so that multiple devices can interrupt on the 
same priority level. 


Inputs to HEX-29 CPU. A7-RO are the eight 
DMA request inputs. F7 is the highest prior- 
ity, RO is the lowest. These lines are active- 
LOW; i.e.. a LOW level requests DMA time. 


Outputs, not three-state. 77-G0 are the eight 
DMA acknowledge lines that reply to the cor- 
resporiding OMA request lines (A7-Ad). A reply 
to the highest requesting priority is acknowl- 
edged by a LOW level on the corresponding 
acknowledge tine. Only one of these lines will 
be LOW at any given time: i.e.. the highest 


*priority request gets the acknowledge. 


Output, not three-state. NRO is LOW when no 
DMA requests (R7-A0) are being received. This 
is used pnmarily as a signal to dynamic memory 
refresh controllers that a refresh may be per- 
formed on any ‘free DMA’ cycle. 


Input to HEX-29 CPU. When DDMA is pulled 
LOW, no DMA requests are acknowledged. Es- 
sentially this line is just the highest priority DMA 
request line - except there is no corresponding 
acknowledge signal. This signal is normally re- 
served for dynamic memory refresh controllers. 
If the refresh interval is about to expire and some 
locations have not yet been refreshed, this line 
can be pulled LOW to disable all other DMA 
devices and assure adequate time fo refresh the 
remaining locations. Note that NRQ is not LOW 
when DDMA is active (LOW). The DDMA line 
should be driven by open collector outputs. 


Input to HEX-29 CPU. When pulled LOW, 
the HALT input will cause the processor to ter- 
Mminate program execution at the conclusion of 
the current instruction. At this time the bus will 
become continuously available for DMA as all 
three-state outputs of the HEX-29 CPU will tum 
off and BA will go active (LOW). This line can be 
held LOW indefinitely. When released, the pro- 
cessor will continue program execution. This line 
should be driven by open collector outputs. 


Output, not three-state. This signal is LOW only 
On memory read cycles when an instruction Is 
being fetched from system memory. Otherwise 
this signal is normally not used except during 
system development and debugging for single 
instruction execution. 


Input to HEX-29 CPU. This is the signal 
from which system reset (CLA) is derived. Nor- 
mally this input is simply grounded with a 
pushbutton or keyswitch to reset the HEX-29 
system. ode 


Output, not three-state. This is the crystal con- 
trolled master oscillator trom which the system 
clock is derived. The period of this oscillator is 
normaily 40ns. (25MHz). 


System Timing 


In any microprogrammed system which must interface to a 
number of external devices (as a CPU must), it is critical that 
considerable forethought be given to the methods of inter-device 
communication. It is quite common to design and build devices 
that operate with very high degrees of reliability — only to find that 
overall system reliability is inadequate when the yarious devices 
are interfaced. re 


One of the utmost goals in designing the HEX-29 CPU was to 
develop an extremely reliable, easy to use, system bus definition. 
Simplicity and reliability go hand in hand and this is reflected in the 
HEX-29 system bus. Perhaps the single most important decision 
in this regard was to define that all memory and W/O device 
accesses by the processor or DMA devices would share one set 
of timing rules. In other words, one set of timing specifications 
applies to any kind of access of any device by any other device. 
Some systems have different timing requirements for all sorts of 
reasons: a few examples are listed here. 


1. Memory read timing is more cntical (shorter) if the memory 
being fetched is an instruction. 

2. Variations exist in the set-up and hold times required on 
read memory vs. write memory cycles. 

3. Memory devices and I/O devices use some different signals 

r and timing specifications. 

4. DMA devices are required to meet a different set of timing 
requirements than the processor. 

5. Interrupt processing routines violate the normal memory 
access techniques, 


Special cases carry special problems and should be avoided like 
the plague. It is always best and easiest to have all devices and 
situations share one set of control signals and one set of tming 
relationships. Another good practice put into effect on the HEX-29 
CPU is the exclusive use of active-LOW bus signals. This is 
important in many respects. First, bipolar logic C's can sink (pull 
LOW) far more current than they can source. Thus any noise 
spikes need to carry far more energy to force the signal into an 
invalid level. Secondly, all signals that three-state (turn-off) will be 
pulled-up (float) to the inactive level. Furthermore, this scheme 
tends to reduce the power required by bus signal drivers and 
therefore reduce heat dissipation. 


Physical design is also important to system reliability, It is wise to 
use four layer PC cards with GND and Vec¢ planes as the intemal 
layers, as do all of the HEX-29 system cards. An additional 
feature of the HEX-29 system bus is that all signals are interlaced 
with GND traces that return directly to the intemal GND plane next 
to each bus signal. System termination should also be provided 
whenever signals must travel more than 18". Bypass capacitors 
should abound on all system cards, one per three IC’s as a 
minimum. The HEX-29 averages one per IC. 


The timing of each machine cycle in a HEX-29 system is a 
combination of synchronous and asynchronous charactenstics. 
Actually, all signals are synchronous with ~ or are synchronized 
by — the master oscillator from which the system clock is derived. 
Thus, despite the fact that some signals seem to be asynchron- 
ous, they are actually synchronized automatically with the system 
clock. The simplicity of this approach will become clear once the 
relationship af all signals to the system clock is explained. 


The conventions regarding the HEX-29 system clock are very 
simple. All machine cycles begin when the system clock goes 
HIGH and end simultaneously with the begining of the next 
machine cycle. The period of time that the system clock (CLK) is 
HIGH is called }, (phase 1) and the period of time that it is LOW 
is called }2 (phase 2). See Figure 10.for clarification. 
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Figure 10. 


During all memory and VO accesses, the processor (or DMA 
controller) must guarantee that all address lines and control sig- 
nals are vabd for at least 20ns before the end of 4, (falling edge 
ot clock). Depending upon the addressing mode, the processor 
wil require a vaniable penod of time to generate a valid address. 
-Thus it is the responsibity of the processor to contro! the period 
of 4, to meet its requirements. If no external accesses are made 
by the CPU, 6, and d> will last onty 8Ons each unless a DMA 
device takes control of the bus on that cycle and requires longer 
umes. 


Similarly, 42 is controlled by the memory and I/O devices on the 
bus. If none are being accessed on a particular machine cycle, 
no control need be exercised on the system clock and ¢2 will last 
for 8Ons. However, when accessed, many memory and V/O de- 
vices more than 80ns to perform a successtul read or write 
operation. They must be able to lengthen 42 of the system clock 
to increase the access time appropriately. This is accomplished 
with the STR bus signal. When a device is accessed that re- 
quires that ¢2 be longer than 80ns, it must bring STR LOW 
within SOns of the falling edgeof system clock (i.e., 50ns into 
2). For every 40ns that STR is held LOW, the system clock is 
heid in its present state for an additional 40ns. 42 can thus be 
extended indefinitely as required by the access time of the ad- 
dressed device. d, can also be extended in 40ns increments 
with the STR signal if so required by DMA devices with slow 
address generation times, or the like. 


A DMA device must gain access to the bus before it can access 
the memory location that it desires. This is very simple. It simply 
pulls its DMA request line LOW and waits for the corresponding 
DMA acknowledge signal to go LOW in reply. Then, at the be- 
inning of the first machine cycle which finds these signals plus 
ny LOW. the DMA device has been granted access to the 
bus and may immediately generate the appropriate signals on 
the address, data, and control buses to accomplish the transfer. 
The memory device being accessed does not care whether it is 
the processor or a DMA device on the bus since the bus signals 
and timing used by the memory card is identical for both. Thus it 
controls 42 with the STA signal as necessary and the access is 
completed in exactly the same manner as if it had been the 
Processor controlling the bus. The Boolean equation for a DMA 
device gaining access to the bus follows — and Figure 11 is a 
schematic showing how easy the implementation can be. 


Gx « Rx - SDMA - CLK = DMA device has access for the 
current cycle 
X = any DMA priority level 
The timing relationships for the HEX-29 bus are shown in 
Figure 12. 


PULSE THIS 
LINE TO 
osy REQUEST OMA 


Cancer necvest 


ENABLES OMA 
ADORESS BUFFERS 
ANO CONTROL 
SIGNALS ONTO 
BUS WHEN LOW 


Figure 11. DMA Bus Signals. 


INTERNAL OPERATION 
Block Diagram 


The block diagram of the HEX-29 CPU (Figure 13) shows the 
following functional modules: 


. System Clock 
. Microprogram Control 
. #Word Memory (Control Store) 
. Am2901A Bit Slice ALU/Register Sets 
. ALU Arithmetic Carry In Control 
. Shift and Rotate Linkages 
Condition Code Control 
. Am2901A Output Bus 

a. Data Output Latches 

b. Address Latches 

c. Memory Management RAM 

d. Condition Code Register 
9. Am2901A Input Bus 
Data Bus input Registers 
Byte Swap Input Registers 
Microword Data Registers 
Clear Byte/Bit Set Logic 
Instruction Decode PROMS 
. Condition Code Register 
10. Interrupt Control 
11. DMA/Retfresh Control 


ONO awn — 
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Sections 8 and 9 are more difficult to isolate on the block diagram 
since they are the buses that connect many function modules 
together. A full detailed schematic of the HEX-29 is shown in 
Figure 14; a fold out drawing at the back of the chapter. A 
discussion of the function of each of the above modules follows. 


System Clock (Figure 15) 


All timing In the HEX-29 CPU is controlled by the system clock. 
The positive going edge of the system clock (LOW-to-HIGH 
transition) marks the end of one machine cycle and the beginning 
of the next. All Input signals to the HEX-29 CPU from the system 
bus are captured on this edge. The next microinstruction is 
clocked into the pipeline register on this edge. 
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Figure 13. System Block Dlagram. 


Normally a system clock is a simpia square wave or more com- 
plex waveform with a fixed period and duty cycle. But the system 
Cock of the HEX-29 CPU is microprogrammed. In other words, 
the period and duty cycle are selected by microword bits in each 
microcycle. The advantage of this approach is one of through-put 
(speed). . 


In any CPU, some internal operations require longer to execute 
reliably than others. And one or more of these operations requires 
the maximum length of time to complete reliably. This is called the 
worst case delay path or “critical path”. Normally the periqd of 
time required to perform this “critical path" operation is chosen as 
the clock period for all instructions. 


Sinca the “critical path” operation may take a factor of 30% to 
100°% longer to execute than typical operations, it is clear that 
Much processor time ts being wasted in any typical program. Two 
mucroword bits are used to control the HEX-29 microprogrammed 
System clock so that each microcycle lasts only as long as 
necessary for the operation being performed. An overall speed 
gain of about 30% to 40% is realized with this technique. This 
was discussed in detail in Chapter II and Chapter SII. 

The master oscillator from which the system clock is derived Is a 
25MHz crystal controlled oscillator. Phase 1 (¢,) of the system 
Clock cycle (Figures 10 and 12) is programmed to be 2, 3, 4or 5 
times the 40ns fundamental penod of the oscillator. The duration 
of d2 of the system clock is 8Ons. Since main memory will rarely 
be as fast as 8Ons access time, a method to allow system mem- 
ory cards to lengthen ¢2 is also provided with the STH bus 
signal. When the STR signal is low, the Am74S161 is disabled 
from counting and the state of the clock will not change until it is 
released and it counts out normally. 


The conventions regarding the system clock are very simple and 
were chosen as the easiest to interface with a variety of memory 
and VO devices. 


All machine cycles begin when the system clock goes HIGH. 
The period of time that the clock remains at a HIGH logic level is 
Called $1. 42 is the period that it is LOW. During all memory 
access (and lO since VO is memory mapped), the processor 
guarantees that all address lines and control bus signals (RW, 
VMA,WP, etc.) are valid and stable at least 20ns before the end 
Of ;. In other words, the CPU must make all bus signals valid at 
least 20ns before ¢2 begins. 


Depending upon the addressing mode being used, the pro- 
cessor will require more or less time to make all necessary 
Signals to the system bus and memory cards valid. 


For example, indexed addressing requires an arithmetic opera- 
tion from the Am2901B’s rather than logical operations or a di- 
rect pass. therefore indexed addressing is bound to take slightly 
longer than immediate, direct, or pointer addressing. 


tis for these indexed operations and some others that ¢; can 
be lengthened in 40ns increments by microword bits ST, and 
STo. So the processor controls the system clock during 4, 0 
meet its requirements. When there is no memory access, the 
minimum 80ns for , is generally more than adequate. Simple 
addressing modes require 80ns-120ns. The most complex ad- 
dressing modes can take 160ng to 190ns using the worst case 
specs for alt IC's.in the, address generation path. 


At the end of ¢, (the beginning of $2), the processor relin- 
Quishes control of the system clock to the memory or LO device 
thatis being accessed. Since UO is mapped into normal memory 
space, there is only one set of timing rules for both memory and 
VO accesses. If no more than 8Ons is required to properly com- 
plete the read or write operation, then 2 will last only 8Ons. But 


the access time of most main memory cards will be greater than 
80ns so a way of increasing the duration of 2 is provided with 
STR bus signals. 


If this signal (STR) is pulled LOW within the first 50ns of d2. $2 
will be lengthened by 40ns for every 40ns that STR is held LOW. 
Thus ¢2 can be extended indefinitely to match the access time 
ol the device being addressed. Naturally this input should be 
driven by open collector outputs so that all cards can share the 
one STR Ime. : 


Though the STR signal is intended to be used during 42 on 
memory reference cycles, it works in an identical fashion during 
¢;. This can be used to advantage by DMA controllers that 
require more than 60ns to generate valid address, data, or con- 
trol signals on transparent OMA cycles. 


Ajumper option on the microprogrammabie system clock allows 
the default period of ¢2 to be increased from 80ns to 120ns on 
memory reference cycles only. This is useful in systems where 
no memory or l/O devices have access times of 8Ons or less, 
and/or when more than 50ns is required to pull STR LOW to 
lengthen #2. Figure 16 is a table of the ¢, and default ¢2 periods 
availab!e with the microprogrammed clock on the HEX-29 CPU. 


Default dz 
Default Period with VMA 
VMA | 6; Pertod| 42 Period | Option Jumpered 
80ns 80ns 80ns 


oo 0 0 42 + + = 
oo #4 9828 8 = = 


Figure 16. Microprogrammed System Clock Timing. 


Microprogram Control 


The microprogram control section (Figure 17) of the HEX-29 CPU 
performs several functions; they are. 


1. System reset and initialization 

2. Interrupt and halt control - 

3. Machine level instruction to microinstruction mapping 
4. Microinstruction sequencing and microsubrautining 
5. Invalid Access Memory Management Trap 


When the system reset button or keyswitch is closed, the input to 
a one-shot is pulled LOW. When it Is released, the rising edge 
triggers a 500 us pulse. This is synchronized with the system by 
gating it through a llip-tlop driver by system clock. The resulting 
signal is used to zero the outputs of the Am2909 microprocessor 
sequencer. Thus, when the one-shot times out. the microprogram 
will begin execution at microaddress 000. The microcode needed 
to initialize the system is stored at this and the following several 
microaddresses and assures the proper system start-up. 


Each time a machine level instruction is fetched, the micro- 
program control logic checks for a hardware interrupt or halt 
Signal from the system bus. if exther signal is active, the micropro- 
gram branches to the appropriate microinstruction address to 
execute the appropriate microcode to service the request. The 
interrupt rout.ne will buffer user registers, switch fo supervisor 
mode. and call a machine fevel routine through a vector table 
element as defined by the priority level of the interrupt. If he hak 
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signa! is puited LOW, the extemal system bus 's released to DMA 
devices or refresh contro'lers unti! the halt bus line is reieased and 
‘he program continues execution. 


When an instructor has been fetchad and there are no .nterrupis 
or haf signals pending. the microprog:am must begin executing 
micto.nslructions ai anew Microaddress. This microaduress 1s a 
funciion of the machine instruction to be execuied. The “map- 
ang” of the machine level instructon into a microaddress ts done 
counesy of the Am27S29 instruction decode PROM s. The op- 
CU 3S placed on the PROM address lines and the micruaddress 
ApNAArs at tire culputs woch are connected to the airect puts to 
the Am290¥s The Am2909’s simply pass this micic.sddress to 
the microword memory by exocuting a Branch lo Addiess on 
drecl -nputs ‘unction, 


Tas. and ail other microprogram sequances operat.ons are 
selected by the outputs of the microprogram bran: PROM which 
1S di'ven by microword bits. This PROM, an AM27S21 contains 
the output combinalions required to execute a var ety of micro- 
progrom control functions including microbrancining, micro- 
subro. ning. ana two-way microbranching ether uccondtionally 
cr nen condibon code b.'s selected by m:crowurd bits. The 
T..clon Code for ths PROM is shewn tn Fig.te 12 

As part of the multi-user. multi-task time siaing capreities the 
HEX-29 CPU provides an invatd meniory acces: tray bv this 
structure, the executve progam cag 48¢:gr af ¥ uNuse tage of 


user memory space as cither non-custen. (ass. 6 oF) OF aS aN 
Address |_____Furciioa | 
| 0 BR © | Qer vartinue 
! ’ 3% 6 = bt Contre 
! 2 | So Qourcom sue 
' 3 i BP V- tor conunue 
4 , GR MN © Ser contro 
} 3 | BR N= fortonteun 
i 3 PR Z = 0 or continue 
H ? BH Z >} 14 or continue | 
4 {®A AH & 0 orcontruc | 
' 9 + BR H- t oF continue 
! A _ BR LZ = Vor continue ! 
H 5 | SH LZ : 1 or continue | 
| c (UR HT = 0 oF continue | 
| n , BROHLT = ior continue 
| E 1; BRIA = Oer caine | 
! F §RoW- 1 of voclirue 
i 100 «1 BR 
i f1 , Neturce 
| 12 | CALL 
13 , Nov esey 
14 CALL N-G 
15 ) Wot used 
eo: ATS 2-1 
17 : Netused 
18 | RE 
| 9 ' Not used 
1A | Notused 
} tb | tol used 
} 1C Vet woe | 
' 7 | Net used i 
| iE | BRNAP Id - GerBR { 
{ iF CONTINUE | 
[oneness Reel eine 


Figure 18. tMicroprogiara Sequencer Brunch Code. 


inverd access area. If any user instruction attempts to access 
Memory in a page that has been assigned as an invalid access 
Page, the microprogram control logic takes action. 


Before the current machine cycle completes, the next instruction 
address 1s forced to the highest value in the current 512-word 
muicroword dlock using the Am2909 OR inputs. At this point a 
miczesrancn to the invalid access trap microroutine is performed. 
The inv ald access !s processed just like another (highest) level of 
hardware vectored interrup! except that the current machine level 
instruction does not complete before the microprogram recog- 
nizes and acts upon the condition. 


MICROWORD MEMORY 


Any number of memory device types could have been chosen for 
the microword memory in the HEX-29 CPU. RAM has the ad- 
vantage that it is dynamically alterable, but if this feature is utilized 
much more hardware support would ‘have been necessary and 
the overall cost increased significantly. Besides, the effect of 
writable control store can be simulated with fixed memory devices 
by microcode bank switching at much lower cost and complenity if 
the feature is desirable. For development of new microcode 
toutines, RAM wntable control store in the address space of 
another computer system offers many advantages. This is par- 
ticularly true if the other computer happens to support a micro- 
assembler and file management system as does the System 29." 


Trough EROM's and EAROM's 1Ve also viable microword mem- 
ory cavices for microcode development, they are much too slow 
to make elficient use of the rest of the high speed micro- 
programmed processor in the production device. 


Fuse-tint 'npu.ar PROM's are the only viable microword memory 
dey ces for procucthicn systems tor a variety of reasons. They are 
very fast, (45ns maximum access on the HEX-29 CPU), small 
i012 § Bin 20 pins} tess expensive than fast RAM, and more 
sicaibie than a mask HOM would be. Itis a simple matter to alter or 
eatend tha microprogram ot commercial systems wilh fuse-link 
PROM micrcword memory. 


As mentioned, the microword memory of the HEX-29 is com- 
posed of AM27S29 512 x 8 Iuse-link PROM’s and is shown in 
Figure 19. These space etticient 20 pin parts have worst case 
access times of 45s over the commercial temperature and 
veitage range Up tu 4k of microwo:d memory can be addressed 
by the set of three Am2909 microprogram sequencers on the 
CPU card. Space for up to 2k of microwo-d memory PROM's Is 
available on the HEX-29 GPU card. Though a perfectly 
adequate instruction set can be coded in fess than 512-words of 
microword memory, the HEX-29 has a very extensive high level 
instruction set incluo ng 16 and 32-bit integer and 64-bit flouting 
point ADD, SUB, MUL. DIV, CMP, and extensive buffering in- 
structions. in addition to the extremely complete numeric pro- 
cessing package, numerous nivble, character, byte, and word 
macroinstructions are implemented with scans, linked and un- 
linked searches, block moves, and etc. A slack processor is a 
subset of this more than complete instruction set. For ail of the 
capabilities of the HEX 29 CPU, less than 1.5k of microword 
memory was required. Thus, more than 0.5k of space remains 
for future expansion by the user before a larger PC card is 
needed (extramely unhkely). 


Connections ler the microword data. address and select lines are 
availabie at cor:iectors at the top of the HEX-29 CPU cai. Thus, 
itis quite straigh tlo-y.ava to support off-board micraward memory. 


*Sytem 2915 a deviloninem oy Jtem for micruprogrammed systems available from 
Advarcud Mts Gk tue. 


CPERESROPEES | 


ETS E ES | 


Q9595252 


H1372] 


5s 


Ssssasas 


3883 


mn I 


| 
| 


iateateelestateteieiedente 


efke tt 


cm Bmrennre O8 


cm amrennre ce 


my 


ce ewes 


die 


ME Maw temtwanc 


araocusy 


9000 


eee | 


4 


a 


* er 


MOR 668 


Figure 19. 


28 


9 


290 


itis even perfectly reasonable to use an off-board writable control 
store with up to 2k of microword RAM concurrently with up to 2k of 
PROM resident on the PC card. 


It the on board PROM contains an instruction set, it is then a 
simple matter to use the off board writable control store to develop 
New microcode for the machine interactively on the one HEX-29 
system! 


The outputs of the microword memory devices are attached to the 
inputs of Am74S374 registers. These registers are called the 
pipeline registers since they allow the fetching of the next micro- 
instruction concurrently with execution of the current one. Clock- 
ing of the pipeline registers occurs on the LOW-to-HIGH transl- 
tion of the system clock. The outputs of the pipeline registers are 
the 64 microword (or pipeline) bits that control every aspect of the 
processor. 


These 64 bits can be logically grouped into several functional 
felds a follows: 


. Microword Data’Microbranch Address and Control 
ALU Source Select ~ 
ALU Destnation Select 

ALU Function Select 

ALU Camry In Select 

. Shift Linkage Select 

. ALU A and B Specifications 

. A and B Fields Select 

. Enable onto ALU inputs Select 

10. Latch External Data Inputs 

11. Latch CPU Outputs 

12. Control Bus Signals 

13. Microprogrammed Clock 

14. Condition Code Controts 

15. Enable Interrupt Circuitry 

16. Memory Map Control 


OMPNAMSAON~ 


Notice that with the exception of the microword data and micro- 
branch address and control fields, no other fields are overlapped. 
This is a ‘horizontally’ structured design. Overlapping several 
fields leads to ‘vertically’ structured systems. This latter class of 
macnines can save some microword memory, but only at the 
expense of through-put and increased hardware complexity. Now 
that the cost of the PROM’'s has come down significantly, the 
Savings accrued from using a vertically structured design ap- 
proach is generally insignificant when compared with the overall 
system cost. 


A summary of the functions of the microword bits is shown In 
Figure 20, 


4m2901B ALU/REGISTER SETS 


The heart of the HEX-29 CPU is the set of four Am2901B bit slice 
ALU'Register Sets depicted in Figure 21. All arithmetic and logi- 
cal operations are performed in these bipolar LS! IC's, including 
address generation. The user accessible set of 16 registers and 
routing functions are also intemal to these remarkable and ex- 
tremely versatile chips. 


The operation of these units, though very elegant and com- 
prehendible, 1s too lengthy to include here and the user is referred 
to the Am2900 Family Data Book by AMD. 


Carry lookahead is accomplished by the Am2901B’s and an 
external IC, the Am2902A. Shift control 1s partally within the 
Am2901B’s and is supported by other external circuitry to be 
discussed later. 


A summary sheet of the Am2901B ALU functions appears on 
Page 29 but should be supplemented by studying the AMD 
literature already mentioned. A good supplement is the AMD 
Schottky and Low Power Schottky Handbook. 


The A and B input fields to the Am2901B's are multiplexed by 4 
Am74S253's in the following four ways. 


Am2901B A Inputs 


pword Memory 

Upper Nibble ABL 
Lower Nibble ABL Lower Nibble ABL 
Upper Nibbie ABL Lower Nibble ABL 


ABL = A,B Latch (On data bus bits 27-20.) 


Am2901B B Inputs 


uword Memory 
Upper Nibble ABL 


CARRY IN CONTROL 


The arithmetic cary-in (Cy) signal (Figure 22) to the Am2901B bit 
slices can be selected from four sources as follows: 

1. Logic O (No carry-in add instruction, borrow in subtract 
. instruction.) 

2. Logic 1 (Carry-inin add instruction, no borrow in subtract.) 

3. Carry Flag (C bit in condition code register.) 

4. Q Shift Bit (Double length shifts.) 


Note that the natural state of the Carry Flag output from the 
Am2901B is 1 for carry on add, 0 for no carry on add, 1 for no 
borrow on subtract, and 0 for borrow on subtract. This convention 
has been maintained in the condition code and carry in logic. 
Some other machines operate differently with respect to this 
convention, but others do not and the HEX-29 maintains the 
faster convention for lack of a good reason to alter it. Some 
programmers will be required to remember this convention while 
others will be used to it. 


SHIFT AND ROTATE LINKAGE 


The shift and rotate linkage (Figure 23a) of the HEX-29 is com- 
posed of an Am74S253 and an Am74LS125 plus the intemal shift 
contro! structure of the Am2901B's. The functions that can be 
performed by this circuitry are shown in Figure 23b. 


The solid lines in Figure 23b delineate the basic shift linkages. 
The dotted lines are optional linkages which can also oe enabied. 
With these linkages, ali of the normal shifts and rotates can be 
performed plus a number of double word shifts including special 
shifts for high speed multiplies and divides. 


CONDITION CODE CONTROL 


The condition code register shown in Figure 24 of the HEX-29 has 
eight flags. The definitions and placement of these flags are 
delined in Figure 25. 


In addition to the very useful and fairly common C, V, N, Z flags, a 
half sign is provided for easier byte processing. The three user 
flags are not changed by any of the normal arithmetic or logical 
operations. However, they can be read by the processor and 
written by the processor with special instructions such as load 
flags, read flags, set bits in flags, clear bits in flags, invert bits in 
flags. The tact that none of the user flags is changed by any but 
this type of special routine is very significant. It means that various 
routines and program segments can pass flags back and forth 
freely without fear of modification or restriction on the instructions 
that can be executed. Reading the condition code flags into the 
processor, orbranchingors broutining upon combinations of bits 
set or Clear does not alter the flags. 


11. 


BRCI 
BRC2 
BRC3 
BRC4 


BRCO | 


Function 


Microprogram 
Branch 


Address 2 
Microword data to Intemal 


data bus to Am29018's 


Microprogram 
Branch 
Control 


Latch in low nibble of data bus 
Control Bits A 
Shift and Rotate MUX 


Am2901 Source Select Code 
Carry-In MUX Select Bit A 
Am2901 ALU Function Code 
Carry-In MUX Select Bit B 
Am2901 Destination Code 


Fetch Instruction this cycle 


Am2901 'S' field register specification 


Am2901 ‘A’ field register specification 


A, B fields MUX Select Bit 0 (A, B fields on Am2901) 
A, 8 fields MUX Select Bit 1 (A, B fields on Am2901) 
Microprogrammed system clock stretch bit 0 
Microprogrammed system clock stretch bit 1 

Latch Data In — Both Swapped and Unswapped 

Latch N, Z, H flags - MUX Select 

Latch C, V flags — MUX Select 

Latch Condition Codes -— U2, U1, UO. H, Z, N, V, C MUX Select 
Read Condition Codes onto intemal bus, low byte 
Select Microword bits 15-0 to internal bus, else branch code 
Data In Low Byte Enable onto internal bus 

Data In High Byte Enable onto intemal bus 

Swapped Data In Low Byte Enabled onto internal bus 
Swapped Data In High Byte Enabled onto internal bus 
Clear Low Byte on intemal bus — Bit Set Enable HIGH 
Ciear High Byte on interna! bus — Bit Set Enable LOW 
Load Memory Map — Write into Memory Map RAM 
Read Memory Map — Enable Memory Map to data bus 
Latch Address — Enable Transparent Address Latch 
Bus Available — Busses available for OMA this cycle 
REad/Write Memory (Write if iow) 

Valid Memory Address (Read or Wnte) this cycle 
Sync. DMA — Ac've cycie be‘cre ius "\ available 
interrupt Logic ©° able 


Figure 2 4. Mi.coword Bus. 
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Figure 20B. Am2909 Microprogram Branch Control, Figure 20C. Condition Code Manipulation, 
. Bits 12-16. Bits 45-47. 


a aes KS 
A+i1 D+Avr!t [O+Q4+1 O+1 

0+Q+C 

Q-O0-1j- 


RIGHT Fi2-B Q2--Q 


RIGHT F2—-B8 


Figure 20E. Am2901 Destination Codes, Figure 20F. Am2901 A, B Field Selects, 
Bits 28-30. Bits 40-41. 


Figure 20G. Shift & Rotate Control, Figure 20H. Microprogrammed System 
Bits 18-19. Clock Stretch, Bits 42-43. 


Figure 20 (Cont.). 
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Carry Flag (Arithmetic) 


Overfiow (Two's Complement Arithmetic) 


Negative (Most significant bit = 1) 
-1) 


for N 
Zero - (Z = Vil Bits in 
word 


Halt Sign - (H <1 If Di,.7-1) 


(Note: Not half carry) 
User Flag 0 - (Can be set/reset! 
Tested by user onty) 
User Flag 1 - (Can be set/reset/ 
tested by user onty) 


User Flag 2 - (Can be set.reset/ 
tested by user only) 


Figure 25. 


Eight condition code operations provide all the useful operations 
needed for complete flexibility. They are shown in Figure 26a and 
26b in two different formats. Note that the codes are grouped into 
three categories: anthmetic (C and V), logicaVarithmetic (N, Z, H) 


_ and user (U2, U,, Up). 


These eight conditions include all the necessary and desirable 
features such as updating only the shift cary bit and the ability to 
do operations that read, operate on, and reload the condition 
code register all in one machine cycle (160ns). Also, a feature of 
immense importance where microcoded floating point or fixed 
point math is concemedis the ability to update tags onacycle by 
cycle basis! An unusual feature. 


Carry/ | Negative/Zero/ User Flags 
Overflow C, V Half N, 2, H U2, U1, UO 


7 Shift Bt CV V No Change No Change 
Shift Bt C.V Vv Load From Bus Load From Bus 
Load From Bus No Change No Change 
Load From Bus Load From Bus Load From Bus 
No Change No Change No Change 
No Changa Update No Change 
Update No Change No Change 


Update Update 
*Less uselut than other codes but perfectly legal. 
Figure 26A. 


No Change 


Shift MSb or LSb into C 
*Shitt nto C — 

Bus Load Rest 
*Bus Load C & V Flags 


Bus Load All Flags 

No Changes 

Update N, Z, H Flags 

Update C and V Flags 
| Update C, V,N,Z,H 


p» = updated, NC = unchanged, B = loaded from mterna! bus, 
S - Shit Br 


Figure 26B. 


Am29018 OUTPUT BUS 


Being a highly structured, modular device, ‘he HEX-29 CPU is 
very bus oriented. The output bus of the Am2901B's generate the 
addresses and data to the rest of the system devices as well as 
some internal function. The four logical units on this bus (shown in 
Figure 27) are: 


1. Address Out Latches — (System Aadieas bus) 

2. Data Out Latches~ (System data bus) 

3. Memory Map/Latches— (Memory Management Features) 
4. Condition Code MUX — (For updating flags from processor) 


Any memory reference requires that an address be valid on the 

system address bus. The source of this address is generally one 

of the Am29018B internal registers or modifications thereof from . 
previous fetch cycles (such as indexed addressing). 


On a write cycle, data must be placed on the system data bus. 
This is accomplished in the same manner as address generation 
except that a different microword bit is used to activate the data 
latches. 


In a multi-user/multi-task/timesharing environment, itis desirable 
to have a powerful memory management scheme. The HEX-29 
CPU implements this via a flexible memory mapping system 
where the upper four bits of the 16-bit address generated by the 
Am2901B's are ‘mapped’ into seven address bits and a wnte 
protect bit. Invalid access traps and one Megabyte address space 
are integral features of this system. The loading of this MAP RAM 
(2Am29701's) is 2! 2 accomplished via the Am2901B output bus. 


Another imeurtant characteristic of the HEX-29 CPU is its ability 
toread, write, test and operate upon the eight condition code flags 
in the byte form. All eight flags can be written to by the Am2901B's 
at one time, in one microcycle. This is very useful for many flag 
operations and is absolutely necessary for efficient updating of 
the user flags for interroutine parameter and condition passing. 


The logic of these bussed systems Is quite simple. A separate 
microword bit or bit field is used to cause each of these logical 
units on the bus to accept the data bus. Therefore, simple micro- 
programming techniques are applicable to this busing approach. 


Am2901B INPUT BUS 


Much of the power and modularity of the HEX-29 design is due to 
the highly structured bus approach on the Am2901B Data Inputs. 
The logical units that can drive this bus (Figure 28) are listed 
below: 


1. Data input Registers 

2. Swap Input Registers 

3. Microword Data Registers 

4. Clear Upper Byte/Clear Lower Byte/Bit Op Logic 
5. Condition Code Register 


Data input from the system bus Is captured in the data input 
registers and the swap input registers. The data input registers 
bring the upper and lower bytes of the data bus to the corre- 
sponding bytes in the Am2901B cascade while the swap input 
registers switch the upper byte of the data bus to the lower byte on 
the Am2901B cascade and the lower byte of the data bus to the 
upper byte of the Am2901B cascade. 


Additionally, logic to set all bits in the upper or lower byte to zeros, 
(clear upper byte and clear lower byte), allow selecting arithmetic 
or logical zeros in either byte field. If the bit set option is enabled. 
ail bits are pulled low except the one selected by the hexadecimal 
value in the low nibble of the nibble latch from an instruction or 
other data source. 


All eight condition code bits can be enabled onto the low byle if 
desired. All flags can thus be sampled by the Am2901B's atonce. 
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Data from microword memory from three-state registers In par- 
allel with the pipeline register can be enabled onto the upper 
and lower bytes for direct loading of the Am2901B's from 
microprograms. 


Inthe absence of any device being enabled onto a particular byte 
on this bus, it will be pulled up into a logic 1 state. This can be 
useful for masking in logical operations and filling or biasing in 
arithmetic operations. : 


An important factor in the flexibility of this approach on the 
HEX-29 is that the upper and lower bytes of the data in registers, 
Swap in registers, and the clear upper/lower byte logic are sepa- 
falely enabled. Also, the condition code register only drives the 
lower byte and the pull-up feature will operate on either byte 
individually. Thus the upper and lower bytes cen be individually 
daven on a ‘mix and match’ basis from several sources. 


The versatility so generated allows numerous fast processing 
modes. See Table 5 for a list of all of the possible combinations of 
high and low byte inputs to the 2901B’s. 


TABLE 5. 
Into Upper Byte into Lower Byte 
of Am2901's of Am2901's 


Mecroword memory bits P15-P8 | Microword memory bits P7-PO 


0. 
1. 
2 
3 
4 
5. 
6 
?. 
8 
9 


Ba set value (upper byte) 
Upper byte - data bus 
Upoer byte - data bus 
Upper byte - aata bus 
Upper byte - data bus 
Upper byte - data bus 
Lower byte - data bus 


. Lower byte - data bus 
. Lower byte — data bus 


Lower byte - data bus 


. Lower byte — data bus 


Ba set value (lower byte) 
Lowor byte - dala bus 
Upper byta — data bus 
Glear lower byle 

All high generator 
Cond:bon code regster 
Lower byte — data Sus 
Upper byte - data tus 
Clear lower byte 

All high generator 
Condibon coda register 


All high generator 
. All tagh génerator 
. All high generator 
. Al hgh generator 
. Ali hgh generator 
. Clear upper byte 
. Clear upper byte 
. Clear upper byte 
. Clear upper byte 

Clear upper byte 


Lower byte — dala bus 
Upper byte - duta bus 
Clear lower byio 

All ugh generator 
Condition cod" register 
Lower byte - data bus 
Upperbr data bus 
All high ;-nerator 
Conditon code regrster 
Clwar tower byte 


*Note: Interestingly enough thrs is the only case 'n the entre table 
that the hardware CANNOT genorate on the bus, but IS the 
ONLY one of these codes that CAN be generated by the AMD 
29018 slices! (How convenient!) 


Examples of uses for some of these modos include: 


1. Clearing upper byte for 8-bit index offset 

2. Fast bil seVclear/tesvinvert operations 

3. Set upper byte high to AND lower byte with upper byte 
change 

. Clear upper byte to AND off upper byte and operate lower 

. Upper byte of data bus to lower byte for allbyte ops on upper 
byte 


ans 


6. Load defined values from microcode for tamper-proof con- 
Stants, vectors, etc. 

7. Normal data input or address input without swap or modifi- 
cation. . 

8. Clear upper byte and data in fow-bite immediate ops, etc. 


INTERRUPT CONTROL 


The powerful maskable priority vectored interrupt system (Figure 
29) of the HEX-29 is a direct derivative of the incredible Am2914 
bipolar LSI interrupt control IC. This circuit is so well mtegrated 
that it uses only one microword bit and requires very litue support 
Circuitry. The general set of operations that can be executed by 
the Am2914 is shown below. For more detailed informabon on this 
chip see the Am2900 Family Data Book. 


F. Enable Request 

Load Mask Register 

. Disable Request 

. Clear Mask Register 

. Bit Set Mask Register 

Bit Clear Mask Register 

. Load Status Register 

Set Mask Register 

REad Mask Register 

Read Status Register 

Read Vector 

. Clear Interrupts Last Vector Read. 
. Clear Interrupts via M Register 
Czar Interrupts via M Bus 

. Clear ail Irienupts 

. Master Ciear 


SANYO DUDN@BO>DOOM 


Flow chars cf the acbons taken in mcrocode by the HEA-29 CPU 
are shown in Figure 30 and Figure 31. 


OMA CONTROL 

The OME. structure is quite straightforward. There are e-gtt ac- 
tive-LOW DMA request lines and eight correspond -g DMA 
acknowledge ines. The h:ghest priority requesting 3a GivA vycie 
al the beginning of the microcycle vefore DMA wii ke alicwed 
gets an acknewledge signal that lasts up until the DMA cycle — at 
feast. 

If no devines are requesi:ng DMA, the NAQ (no request) bus 
signal goes LOW. This is an excellent opportunity for dynamic 
RAM circuitry to refresh sequential rows on each DMA cycie that 
NRO ts LOW. 

Another input signal ODMA, will overnde ail priontes and not 
acknowledoe any level of DMA request This could ba used by 
dynamic RAM refresh circuilry when it must be pemitted to 
refresh :tsell soon or chance losing data. 


Many schemes of DMA handling can be accomplished with this 
simple and uncomplicated priority controlied system. An 
Am74S374 captures the DMA requests (Figure 32} ona cycte by 
cycle vucis. An Am2913 prioritizes these requests ano acKrcw- 
tedges the highest level request with a three-bit binary code. An 
Am74S138 expands this to the eight bits of DMA acknowledge 
that correspond to the eigh! input bits. The Am2913 supphes the 
NRQ bus signa! and provides for tha DDMA bus signal. 
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Figure 29. 
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CURRENTLY 
IN SUPERVISOR 
MODE 


, 


DISABLE ALL LEVELS OF INTERRUPTS 


JUMP INDIRECT ON 
INTERRUPT VECTOR 


EXECUTION OF INTERRUPT 
ROUTINE IN SUPERVISOR, 
SUPERVISOR MAY RE-ENABLE 
INTERRUPTS IF DESIRED. AT 
END OF INTERRUPT PROCESSING 
ROUTINE, THE SUPERVISOR 
RETURNS CONTROL TO USER 
PROGRAM WITH AN 
ATIINSTRUCTION. 


Figure 30. 
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CURRENTLY 
IN SUPERVISOR 


MLLEGAL ATTEMPT OF 
USER TO EXECUTE A 
PROTECTED INSTRUCTION, 
NO OPERATION 


= 0? (le. RETURNING 
TO SUPERVISOR?) 


EXECUTE NEXT 
SUPERVISOR INSTRUCTION 


Figure 31. 
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SYSTEM BUS INTERFACE EXAMPLE 
HEX-64KBS STATIC MEMORY CARD 


It was possible to design the system bus to be very simple to work 
with because fhe HEX-29 is a microprogrammed device. The 
foitowing section discusses an implementation of a 64k byte static 
memory card for the HEX-29 system bus using Am9124 memory 
ICs. The purpose is to show that des'gning cards that interface 
with the HEX-29 system bus !s relatively easy. Note that a design 
for VO devices would be similar to this implementation since I/O 
devices are memory mapped and share exactly the same set of 
bus signals and timing requirements. 


Starting trom the left hand of the schematic shown in Figure 33, 
we find that tha low 13 bits of the address bus and the four control 
bus signals (CLK, VMA, R/W, and WP) are buHered from the 
system bus by two Am74S240 ICs and three sections of an 
Am74S244. These are inverting and non-inverting buffers re- 
spectively, and olfer extremely high current drive (64mA sink 
current) and very high speed (~4 to 6ns) with only very light bus 
loading (400A low level). 


Ten of the address lines buffered by these ICs then drive the 
address tines of half of the memory array through series type 
termination resistors. These resistors (~33 ohms) serve to pre- 
vent undershooting zero volts by more than the permissible 0.5V 
on negative edge transitions of the address lines. This type of 
termination has the advantage that it does not draw current from 
the driver ICs; it Is highly recommended over split termination for 
memory arrays where current toading is negligible, but capacitive 
loading is significant. Note that to further reduce these capacitive 
loading effects, the address lines of only half of the memory array 
are driven by one set of buffers. (Find the second set of 
Am74S240 address buffers at the far right of the schematic.) 


The remaining 3 address lines that were buffered by the 
Am74S240s drive the A, B, and C inputs of (4) Am74S138 
one-ol-eight decoders. These ICs develop the 32 1k word chip 
selects that enable the appropriate Am9124 memory ICs for read 
and wnite operations when they are addressed. 


Of course only one of the Am74S138 ICs should be enabled when 
the board is addressed. This is a function of the higher address 
lines, A18-A13. Since each Am74S138 is able to select 1k word 
blocks of memory, each Am74S138 should be addressabie on 8k 
word boundaries. Decoding the upper address lines (A18-A13) to 
match selectable 8k boundary addresses is accomplished with 
four Am25LS2521 8-bit equal to comparators, one for each 
Am74S138. 


The OIP switches on the right hand side of each Am25LS2521 
detine the conditions under which the corresponding Am74S138 
will be selected. When the eight inputs on the left hand side of 
these chips correspond to the values set on the DIP switch on the 


right, the Am74S138 Is enabled. Note that the VMA bus signal 
(Valid Memory Access) must be LOW to enable the 
Am25LS2521. Also note that each 8k word bank can be uncondi- 
tionally removed from the system memory space by leaving the 
lowest DiP switch open. Thus the board may be filled in 6k word 
increments if desired. 


The Am74S138 ICs are also enabled by the system clock via the 
CLK signal. Therefore, memory chip selects can only occur dur- 
ing the time that the system clock is LOW (called #2). The impor- 
tance of this will be discussed shortly. Another signal that must be 
valid for these ICs to be enabled is the DIS Signal. Wherever the 
R/W signal is LOW (indicating a write) and WP (write protect) is 
HIGH (protect the memory), then the DIS signal is brought LOW. 
This disables the Am74S138's and blocks the selecting of any 
memory ICs, thereby write protecting all on-board memory. 


Above the memory array on the schematic are the data bus 
buffers, one set for each half. Again, this is done to reduce 
capacitive loading, this time on the data lines. Am74S373 octal 
tri-state latches are used for all eight of these data buffers. The 
enable inputs are driven by the inversion of the system clock bus 
signal so that they are transparent during all of é2, which is when 
the data is transferred. The appropriate Am74S373 latches are 
turned on (OE LOW) during read and write signals so that the data 
is buffered in the proper direction. 


The Am26S02 one-shot is used to stretch d2 of the system clock 
to meet the access time of the memory. Without this signal, 2 
would last only 80ns and the access time specifications of the 


. Am9124 memory ICs would not be met. The Am26S02 is acti- 


vated whenever memory ICs on the board are addressed when 
the system clock enters d2 (negative edge). Once fired, the 
duration of 2 Is stretched by 40ns for every 40ns that the STR 
bus signal is held LOW. Since the Am9124 EPC memory de- 
vices have an access time of 200ns worst case, 2 must be 
stretched by 120ns. 


Summary 


As can be seen, the HEX-29 16-bit design represents a simple, 
straightforward design approach to building a high-performance 
16-bit processor. This design takes advantage of many of the 
features of the Am2901 and Am2909. The instruction set shown 
in this application note is intended to be representative of the 
more common types of instructions to be executed on a machine 
of this class. In addition, microcode could be developed to exe- 
cute a great many additional instructions as well as other classes 
of instruction such as entire floating point package. This design 
utilizes microprogram control throughout, and is a good demon- 
stration of parallel microprogramming in a most straightforward 
application. 


AMD wishes to thank Mr. Mike Simmons and Mr. Lee McDonald 


of HEX for their work on this invited paper as a part of this 
application note series. 


APPENDIX 


HEX-29 Microcode 


This appendix contains 256 words of HEX-29 microcode. The first 
partis a definition file which defines the HEX-29 hardware struc- 
ture for the AMDASM™ assembier. The various inputs to the 
Am2901 are defined via equates while all other microword fields 
are literally detined. The second part is the assembly file which 
symbolically, via terms defined in the definition phase, constructs 
each microword. Each microword begins with an optional label 
(such as RESET:). Next is the Am2909 branch control field, 
followed by all of the remaining control fields. This structure gives 
the appearance of a conventional assembler, i.e., LABEL, OP- 
ERATION, OPERANDS. A microinstruction which has no 


Am2909 branch control specified, such as microwords 3 and 4, 
uses microword bits 0-15 (which includes the branch control field) 
to place immediate data directly on the intemal Am290t bus. The 
Am2909 is then forced to “CONTINUE” by the “LIN” field. LIN, 
besides Latching IN the data on the Am2901 bus. disables the 
microprogram branch control register output, causing the “CON- 
TINUE” function to be selected in the branch control PROM (see 
Figure 208). 


These 256 microwords represent a reasonable subset of the 
HEX-29 standard instructions, i.e.. branch, conditional branch, 
data moves (MDV), and, or, add, sub, etc. 
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Super Six 


INTRODUCTION 


The AMD 16-Bit Computer design is an example of a high-speed 
microprocessor system which takes full advantage of AMO's 
Am2900 Family of Bipolar microprocessor circuits to provide an 
economical, high performance, self contained 16-bit computer. 
was designed to demonstrate the principles of a micropro- 
grammed system. 


Ths design is intended to show some of the techniques used to 
achieve high performance. This includes pipelining at the micro- 
Program level as well as pipelining at the macro or machine 
struction program level. A powerlul instruction set is demon- 
Strated which allows the user to write efficient programs in a 
minimum amount of time. 


One of the unique features of the design is that in addition to using 
the high performance Am2900 Bipolar microprocessor family, it 
lakes advantage of the MOS peripherals normaily associated 
wth MOS microprocessors. These are used to perform the 
slower functions, Panticularly in the VO interface area. 


SYSTEM ORGANIZATION 


The 16-Bit Computer is designed to perform in a system environ- 
ment as shown in Figure 1. The system consists of a central 
Processing unit (the 16-Bit Computer), memory units, /O units 
(Penpheral controtiers), and a bus controller. These units com- 
municate over the system bus consisting of a 16-bit wide address 
bus. 16-bit wide bi-directional data bus, and a control bus. The 
Control bus is a collection of signals that include the memory and 
LO interface controls and the interrupt request lines. 


This organization allows systems to be configured with more than 
one CPU and multipie memory and VO units. The bus controller 
arbitrates requests for bus use from the CPU's or /O units that 
require DMA transfers. 


This application note concentrates on the design of the CPU 
portion of the system. 


INSTRUCTIONS 


An instruction ts either one or two 16-bit words in length and must 
be located in main memory on an integral word boundary. The lett 
most eight bits of the instruction is always the operation code, 
followed by two, 4-bit register des:gnation fields (Figure 2). The 
16-bit (one word) instruction is always this format. The 32-bit (two 
words) instruction has the first (left most) word exacily like the 
16-bit instruction. The second word of the 32-bit instruction is 
always fut! 16-bit value (d) which acts as a memory reference 
address or an immediate value (Figure 3). This architecturally 
simple instruction format becomes very powerlul when im- 
plemented on a microprogrammed machine. 


The 8-bit cpcode provides for 256 primary instructions. which is 
usually more than enough for most general purpose computers. 
The 4-bit register fields (R, and R2) each designate one of the 
sixteen, 16-bit registers (Ro-Riy5). Depending upon the operation, 
each register can act as either an accumulator for anthmelic and 
logic operations, or an index register in modulo address anthme- 
tic. On operations where the result is placed in a register, the Ry 
field depicts the destination register and R2 (or R2+ 4) Is, or points 
to the source field in main memory. On operations where the 


Figure 1. System Organization. 


resultis transferred from a register to memory. the R, field depicts 
the source register and R2 (or R2+d) points to the destination 
memory location. Memory to memory transfers will have R, as 
the source pointer and R, as the destination pointer. Even though 
the R, and Rp fields are architecturally wired to the Am2903 
register address inputs, vanations of the source/destination as- 
signment may be implemented via microcode. 


The complete defined standard instruction set is given in Table 1. 
This is a typical “machine level” instruction set. It allows mampu- 


Figure 3. 32-Blt instruction (RX, RSI). 
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Table 1. 16-6it Computer Instruction Summary Mnemonic Instruction Format. 


FIXED-POINT LOAD/STORE INSTRUCTIONS 


LO LOAD 
st STORE 


RR, RS, SS, RX, RSI 
RS, AX 


FIXED-POINT ARITHMETIC INSTRUCTIONS 


ADD ADD 
ADD WITH CARRY 
SUBTRACT 
SUBTRACT WITH 


TEST IMMEDIATE 
COMPARE 
COMPARE LOGICAL 
MULTIPLY 

* MULTIPLY UNSIGNED 
DIVIDE 
ONES COMPLEMENT 


BYTE INSTRUCTIONS 


LOAD BYTE 

INSERT CHARACTER 
STORE BYTE 
EXCHANGE 

BYTE SWAP 
COMPARE LOGICAL 


SYSTEM INSTRUCTIONS 


LPSW LOAD PROGRAM 
STATUS WORD 

STORE PROGRAM 
STATUS WORD 

EXCHANGE PROGRAM 
STATUS WORD 

SVC —- SUPERVISOR CALL 

SETP SET BIT PSW 

ASTP RESET BIT PSW 

TSTP ‘TEST BIT PSW. 

CMPP COMPLEMENT BIT PSW’‘ 


STACK INSTRUCTIONS 


BRANCH AND STACK 

RETURN 

PUSH 

POP 

PARTIAL PUSIt 

PARTIAL POP 

LOAD STACK POINTER 

LOAD STACK LOWER 
LIMIT 

LOAD STACK UPPER 
LIMIT 

STORE STACK POINTER 

STORE STACK LOWER 
LIMIT 

STORE STACK UPPER 
LIMIT 


SPSW 


EPSW 


RR, RS, SS, RX, RSI 
RR, AX 
AR, AS, SS, RX, ASI 


RR, RX 

RR, RS, SS, AX, RSI 
AR, RS, SS, RX, AS! 
RR, RS, SS, RX, RSI 
RSI 

RR, RS, SS, RX, RSI 
RR, RS, SS, RX, RSI 
RR, RX 

AR, RX 

RR, RX 

RR, AS, SS, RX, RS! 


RR, RX, ASI 
RR, RX, RSI 
RR, RX, ASI 
RR, RX, RS# 
RR, AX 


RR, RS, RX, RSI 
RR, RS, AX, RSI 
RR, RS, RX, RS! 
RR, RS, RX, AS! 


EXTENDED INSTRUCTIONS 


TRANSLATE 
TRANSLATE AND TEST 
MOVE LONG 
COMPARE LONG 
EXECUTE 

DECIMAL ADD 
DECIMAL SUBTRACT 
OECREMENT INDEXES 


SHIFT/ROTATE 


SHIFT RIGHT LOGICAL 

SHIFT RIGHT 
ARITHMETIC. 

ROTATE RIGHT 

SHIFT LEFT LOGICAL 

ROTATE LEFT 

SHIFT RIGHT DOUBLE 
LOGICAL 

SHIFT RIGHT DOUBLE 
ARITHMETIC 

SHIFT LEFT DOUBLE 
LOGICAL 

SHIFT LEFT DOUBLE 
ARITHMETIC 

ROTATE RIGHT DOUBLE 


ROTATE LEFT DOUBLE RX, RSI 


UO INSTRUCTIONS = 


IN 
INB 
OUT 
OUTB 


RR, RX 
RR, AX 
RR, AX 
RR, RX 


INPUT WORD 
INPUT BYTE 
OUTPUT WORD 
OUTPUT BYTE 


BRANCHES 


BR 


BC 


UNCONDITIONAL 
BRANCH 

UNCONDITIONAL 
BRANCH REGISTER 

BRANCH ON CONDITION 
TRUE 

BRANCH AND LINK 

BRANCH AND LINK 
REGISTER 

BRANCH ON INDEX HIGH 

BRANCH ON INDEX LOW 
OR EQUAL 


lation of bit, byte, word and multibyte data; PUSH/POP single or 
multiple registers to/from stacks; maintain multiple stacks; deci- 
mal, binary and integer arithmetic; byte and word W/O; and 
Maintain supervisory control over hardware and software ¢n- 
erated interrupts. 


Instruction Format 


Many of the instructions have multiple formats. These formats 
depict addressing modes and determine where the source and 
destination fields are located. The defined instruction tormats are 
Shown in Figure 4. 


” Figure 4. Instruction Formats. 


The instructions set consists of nine instruction groups: 


- Fixed-point load/store 
~ Fixed-point anthmetic 
- Byte 

~ Shifvrotate 

— Branch control 

- 0 

~ Stack 

- Extended 

- System 


Acomplete description of each instruction is given in Appendix A. 


CENTRAL PROCESSING UNIT ARCHITECTURE 


Processor Organization 


The organization of the computer is shown in Figure 5 (Computer 
Block Diagram). The computer is organized into several distinct 
sections, the Program Control Unit (PCU), the Arithmetic and 
Logic Unit (ALU), and the Computer Control Unit (CCU), the Data 
Path, the Memory Control and Clock Control, and Inpul/Output 
Interface and Interrupt Section. The logic diagrams for the CPU 
are located in Appendix F. Earlier chapters in the Build a Micro- 
compuler series have described the principle sections of acom- 
Butler and the Am2900 components used in these sections. This 
chapter descnbes how these components are used to implement 
4 very high-speed low cost computer. 


Note: Figure 5 is sheet 1 of the logic diagrams. 


The Program Control Unit 


The Program Control Unit (PCU) under control of the micropro- 
gram is used to update the Program Counter and load this vaiue 
into the Memory Address Register (MAR) for reading instruc- 
tions/data from main memory. The PCU is also used to update the 
stack pointer and compare this value to the stack limits during 
stack operations. As can be seenin Figure 5, the Computer Block 
Diagram, data can be sent to the PCU from the ALU via the 
Transfer Register. The PCU can also output data onto the PCU 
bus to the Y-bus of the ALU via the bi-directional PCU transfer 
drivers. ; 


REGISTER INDEXED STORAGE 


REGISTER STORAGE IMMEQHATE 


The PCU is organized around four Am2901's. The use of 
Am2901's allow the PCU to generate addresses with the flexibility 
of an ALU chip, to increment the Program Counter by two in one 
microcycie, and to provide the stack pointer registers tor in main 
memory stack operations. The registers of these Am2901's are 
defined as shown in Figure 6. Register 0 holds the program 
counter and Registers 4 and 5 hold constants for incrementing. 
Byte addressing requires the address to be incremented by two 
every time 16 bits of instruction data are fetched. 


The Arithmetic and Logie Unit (ALU) 


The ALU shown in Figure 7 is organized around four Am2903's. 
The Am2903 performs all of the functions performed by the 
Am2901A but also provides the computer with separate DA bus 
and DB bus input ports as well as additional instructions to im- 
plement multiplication and division. Three major buses connect to 
the ALU: DA, OB and Y buses. The memory data from the Zp 
Register and microcode immediates are brought into the Am2903 
through the DA port while Program Status Bils 16-23 enter viathe 
OB port. The Am2903's output or receive data on the Y bus '2r 
loading into the RAM registers. The Am2903's zero decode Ivy. 
detects zero on the Y port whether or not the Y portis receiving 1 
sending data. 

To implement the defined instruction set, the RAM registe: sete 
tion controls are sent from the instruction ql) Register le iit. 
Am2903's. {9.3 (used with instructions with the Ry oa: X_ te 
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Not used — available 


Not used - available 


Not used (wired disable) 


Figure 6. PCU Register Assignments. 


connected to the A address inputs on the Am2903 while I,.7 are 
connected to the B address inputs. The ALU operations per- 
formed are controlled by microcode bits Mzg.gg which are con- 
Nected to the Am2903 Ip.g inputs. 


The Am2904 provides the microcode and machine status reg- 
isters holding the carry, negative, zero and overflow status. The 
machine status bits C, N, Z and OVR are defined as PSW bits 
16-23. Logic in the Am2904 includes a condition code multiplexer 
to select the true or complement of any of the four status bits and 
combinabons of status bits from either the machine or micro- 
Status registers or directly from the ALU. This condition code 
multiplexer is controlled by Instruction Register bits Iq.7 which are 
gated to the Am2904 ly 5 inputs dunng the execution of a condi- 
tional branch. The output of the multiplexer, labeled TEST is 
routed to the test tree for input into the Am2910. The Am2904 also 
provides the shift linkages and shitt linkage control and selection 
ol the type of carry signal to the ALU and lookahead carry unit. 


The ALU is designed to work with byte operations as well as 16-bit 
operations. Byte operations operate only on the lower 8 bils of 
register data without affecting the upper 8 bits of data. During byte 
Operations the WORD signal (Mop) goes inactive disabling the 
Write Enable and Output Y Enable for ALU bil slices 3 and 4. The 
word: byle multiplexer circuit will select C, N and OVR status bits 
from ALU bit sice 2 and al the same time ALU bit slice 2 has its 


MSS input pulled LOW to indicate most significant slice. The . 


zero status bit being OR tied to all of the ALU bit slices cannot be 
muluplexed. instead the Y bus signals 8-15 are forced to zero by 
gating zeroes Irom the PCU resulting in the Z signal line state 
being a function of ALU bit slices 1 and 2 only. 


The Computer Control Unit 


The Computer Control Unit controls the sequence of execution of 
the microinstructions. The Am2910 Microprogram Controller pro- 
vides the sequencer for the microprogram (see logic diagrams 
Sheet 5). Branch addresses and counter values loaded into the 
Am2910 Dg.11 inputs, onginate from the Pipeline Register 
(Mo.1}). the interrupt vector decoder, and the machine instruction 
decoder. The instruction decoder. also called Mapping ROM, (a 
512 x 8 PRGA) uses the Instruction Register Ig. as address bits. 
with the PROM outputs being the starling address of the micro- 
code sequence that executes each machine instruction. fn this 
design the Am29775 Registered PROM's are used to provide 
both the microprogram memory (512 x 96 bits wide) and the 
Pipeline Register. The microcode bits Mig.29 are output from 
Am29774 because these signals require open collector outputs 
rather than the standard tn-state outputs to allow the Am2910 
inputs Ig.3 to be pi‘"1d to zero 


The starting address generation for the interrupt service routine 
and initialization routine is accomplished with a minimum of extra 
logic. During the last microcode cycle of the previous machine 
instruction, the MAPEN signal is activated to enable the output of 
the Mapping ROM. However, if an interrupt request is pending, 
the Mapping ROM is disabled and the puil-up resistors force the 
eight least significant microprogram branch address lines to all 
ones, vectoring the microprogram to the Interrupt service 
routine. After a reset, the microprogram should be vectored to 
address zero, the starting address of the initialization routine. 
This is accomplished by having the reset signa! force zeroes Into 
the Am2910 lo. inputs which causes the Am2910 to output ad- 
dress zero. 


Clock and Memory Control 


The architecture of this computer achieves its high throughput by 
being able to execute machine instructions in as litte as one 
microcycie, This is accomplished by overlapping (also called 
pipelining) the fetch and decode with the execute microcycies. An 
essential part of this design is the memory control section. The 
clock and memory control circuits shown in Sheet 6 of the logic 
diagrams work together to provide a very efficient mechanism tor 
integrating memory operations with the computer. The memory 
interface timing is a clocked handshaked protocol shown in Fig- 
ure 8. Each memory transfer consists of a Bus Request, Bus 
Acknowledge response, Memory Request, Address Accept re- 
sponse, Data Request and a Data Sync response. At the 
maximum rate a memory interface response can occur 50ns after 
the computer activates a controf line. This makes It possible to 
tead from main memory once every microcycle (4 x 5Ons = 
200ns); however should a particular memory board require a 
longer cycie, it can delay sending Data Sync to the computer to 
extend the cycle. 


The read and write timing are shown in more detail in Figures 9 
and 10. Note that if a memory read is taking place during microcy- 
cle N, the Bus Request, Bus Acknowledge and the start ol 
memory address are output from the computer in the previous 
N-1 cycle, and the dala is sent to the computer during the first 
half of the following N+1 cycle. Now consider the case of 
back-to-back main memory read cycles. in this case, in the mi- 
crocycle that the computer sends the address to the memory 
board, the memory board is sending data to the computer; but this 
is not the data associated with the address being received but the 
data associated with the address received during the previous 
microcycie. 

A free running or uncontrolled 20MHz clock on the backplane is 
connected to all of the devices which elfect memory transfers 
(CPU, bus controller, and memory modules). All of the signal 
handshaking that is required by the memory interface protocol is 
clocked with the same 20MHz clock to ensure no metastable 
conditions occur during memory transfer. Careful examination of 
this memory interface operation will reveal that not only does tt 
solve the very serious metastable problem, but also that the clock 
synchronization and bus propagation delay occur during the 
memory read access time (or wrile time) and do not siow down 
thea memory transfer rate. r 


The CPU clock generation Is intimately related to the Memory 
Control Logic. The CPU clock signals Phase-1 (d,) and Phase 2 
(2) are shown along with he memory interface signats in Figure 
8. Phase 1 is a square wave set high at the beginning of the 
microcycle and has a periad of 200ns. Almost all operations of the 
computer are clocked with the leading edge of #,. The clock 
control logic will enable the next cycle only ifa Bus Request has 
teceived a Bus Acknowledge and only if a Memory Request has 
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Figure 8. Clocked Handshaked Protocol. 


received a Data Sync response. If the bus or memory resources 
of the system are temporarily being used by other processors, the 
computer will stop the clock and wait. 


Data Path 


The Data Path logic incorporates 8-bit wide devices wherever - 
possible. The D Register drives directly onto the external data 
bus. Both main memory and VO data are received through the Z ° 
Registers. Registers Z, Zo and Z, are actually latches im- 
plemented with Am74S373's. The Z Register enable latch signal, 
LDZ is derived from the memory control logic and main memory 
board fogic both of which are clocked with the uncontrolled 
20MHz clock (2OMHZUNC). Using the uncontrolled clock allows 
the memory operation to go to completion at memory speed even 
when single stepping the microcode. This allows the system to 
use dynamic RAM's in the main memory since stopping the 
handshaking circuits during single step would prevent refresh — 
Operations from taking place. rs 


Data from the main memory passes through the Z Register to the 
Zq and Z, Registers. The Zp and Z, Registers are enabled trans- 
parent ai the beginning of the microcycie following the read main 
memory microcycle. This allows memory data to flow through the 
Z and Zy Registers (actually latches) to the ALU or flow through 
the Z and Z, Registers to the Instruction’ Decoder (Mapping 
ROM). The Z, and Zp Registers are ncked down halfway through 
the microcycle guaranteeing the computer solid data and making 
it possible to send dala from the D-Register out to the extemal 
Data Bus during the second half of he same microcycie. This is 
another example of how this design tightly dovetails data trans- 
fers in order to gain very high execution rates. 


Interrupt and Input/Output 


The interrupt and UO section is shown in Sheet 7 of the logic 
diagrams. 


The basic Interrupt handling is controlled by the Am2914. In this 
design the Am2914 is used to prioritize and enable interrupts, 
provide the mask register, generate an Interrupt Request and 
Interrupt Vector. Interrupt nesting is done in the machine software 
interrupt handler. The extemal interrupt request signals (INTo- 
INT) are input into the Am2914 from the extemal Control Bus (C 
Bus). When a peripheral controller requests computer servicing, 
it activates its assigned Interrupt line. If this interrupt level is 
unmasked and interrupts are enabled, the Am2914 activates the 
INTERRUPT REQ signa! that goes to the Computer Control Unit 
which causes the microprogram to vector to thé microcode inter- 
Tupt service routine. This microcode routine pushes the PSW 
onto the main memory stack, then reads the interrupt vector from 
the Am2914 and uses this value to vector the computer to the 
machine software routine that services the interrupt. 


The Am9519 MOS Universal Interrupt Controller is Incorporated 
into the design and its Group Interrupt signal is connected to the 
least significant INT input of the Am2914. The Am9519 handles 
an additional eight interrupt leveis-for low speed requesting de- 
vices. This MOS LS! component offers the computer comprehen- 
Sive interrupt handling capabilities at low cost. One feature the 
Am9519 offers Is the capability of software generated interrupts. 
The console function, single instruction stepping, is implemented 
using a microcode routine that uses the software generated inter- 
tupt capability. 
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Figure 9. CPU Read Timing. 
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Figure 10. CPU Write Timing. 


9ZE 


The VO protocol tor the AMD 16-Bit Computer is similar to that 
required to control AmM8080/9080 peripheral circuits. As shown in 
Figures 11 and 12, the computer outputs the address over the 
System address bus, activates a control line (e.g., ORD) and 
holds these outputs until receiving a response, IOACK, from the 
Penpheral controller. Execution of the I/O operation is done al- 
most entirely in microcode with the VO Control Register, a single 
Am2920, being the only additional hardware required. This is an 
example of a design precept followed in this computer which is to 
plement all features in microcode wherever possible. This re- 
Sults in a low cost computer, although sometimes slower, and a 
Gesign that is flexible and easily moditiable to meet new 
fequiraments. 


The UO section has two Am8251/9551 Programmable Communi- 
Caton Interface components giving the computer two serial ¥O 
Ports, one of which is reserved for the console. The console can 
be any standard RS-232 interface terminal. ‘ 


instruction Execution 


To-execute instructions, the main steps performed by the com- 
puter are: (1) form memory address, (2) instruction fetch, (3) 
decode, (4) displacement fetch, (5) form operand address, (6) 
operand fetch, and (7) execute. Every instruction type is made up 
of Microinstructions that execute these basic steps, but most 
structions require three steps or less. Instruction sequences for 
Register to Register (RR) and Register to Indexed Storage (RX) 
structions are shown in Figures 13 and 14 to illustrate how the 
Computer operates. These figures show the AR instruction re- 
Quinng four microcycies and the typical RX instruction requiring 


Cr 
REO BUS (CPU) 
LOAD MAR 


20aD D AEG 


seven microcycles. However, as will be explained later, in actual 
operation the effective time for an RR instruction is one microcy- 
cle and three for the AX. 


Form Instruction Address 

During this microcycle the instruction address is formed by having 
the Program Control Unit (PCU) under control of the micropro- 
gram incremeni the Program Counter by two. This address is 
then loaded into the MAR and back into the PC. 

At the beginning of the cycle, Bus Request is activated causing 
the Bus Controller to respond with Bus Acknowledge. The ad- 
dress is then output from the MAR out 0”: the Address Bus 50ns 
prior to the beginning of the next cycle. 


Instruction Fetch 


During this cycle, the main memory is fetching the contents of the 
address previously generated. The computer is designed to work 
with high-speed main memory capable of reading a memory 
location in one microcycle so that the instruction will be sent back 
to the computer at the beginning of the next cycle. 


Decode Cycle 
The instruction fetched from main memory during the previous 


* cycle is sent to the computer at the beginning of the cycle. The 


instruction falls through the Z and Z, Registers (actually trans- 
parent latches) andis routed to the Instruction Decoder (Mapping 
PROM). The Instruction Decoder translates the 8-bit operation 
code of the instruction into an 8-bit address used as the starting 
address for the microprogram that will execute this instruction. 
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Figure 11. ¥O Read Timing. 
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Microinstruction 

Operation 
Form Instruction Address 
Instruction Fetch 
Decode 
Displacement Fetch 
Form Operand Address 
Operand Fetch 
Execute 


Form Instruction Address 
Instruction Fetch 
Decode 


Displacement Fetch 
Form Operand Address 
Operand Fetch 
Execute 


Figure 12. /O Write Timing. 


Microcycle Time 


Figure 14. RX Instruction Sequence. 


Displacement Fetch Cycie 


After every instruction fetch another read cycle takes place The 
second memory read will be another instruction fetch or an 
operand displacement fetch. The computer does not know what 
kind of a read out it is until the instruction decode 1s finished. For 
an RX instruction, after the memory read is completed, the com- 
puter identifies i as a displacement. 


Form Operand Address Cycle ia 


The memory word is sent from the main memory at the beginning 
of this cycle and then passes through the Z and Zp Register and 
goes to the ALU (Am2903’s). The ALU adds the displacement 
and the contents of the register specified by X> field in the opcode 
and forms an operand address which is then loaded into the MAR. 
This has to be completed 50ris before the end of the cycle. 


Operand Fetch Cycle 


The memory read cycle is performed and the operand is sent to 
the computer at the beginning of the next cycle. 


Execute Cycles 


As the name implies, these are the microcycles that perform the 
task of the instruction but with the Am2903's normally only one 
execute cycle is required; however, some instructions (e.g.. 10 
instructions) take as many as seven execute cycles. 


Simultaneously with the last execute cycle the Instruction De- 
coder is enabled. 


Pipelined Operations 


Hthe architecture of the computer executed each of the instruc- 
tons and each microstep sequentially, this computer would be 
fst another computer relying on a high-speed clock to gain high 
throughput. However, the 16-Bit Computer becomes an excep- 
tonal machine by using pipelining techniques. In this approach, 
he instruction steps for the following instructions are done during 
the decode and execute steps of the current instruction. The 
PPelining operation for a Register to Register class of instructions 
5s shown in Figure 15. With the pipeline full, note that when 
Mstruction A is being executed, instruction B is being decoded, 
mstruction C is being fetched from Main Memory and the MAR is 
being loaded with the address for instruction D. In the following 
¢ycle, RA instruction B is executed and RR instructions C, D and 
E proceed through the pipeline. The pipelining technique results 
man RR instruction effectively being executed in one microcycle. 
As illustrated in Figure 16, anew Rx instruction can be executed 
every three microcycles. 


Pipelining is great for throughput, but it is a bear to microcode 
@specially the first time through since during any one cycle up to 
four instruction sequences have to be considered. It is not as bad 
as it first appears. Note that an instruction decode cannot lake 
P'ace until the last execute cycle of the current instruction. The 
Major pipelining takes place during the first three steps: form 
memory address, instruction fetcti, and decode. Execute and 
operand fetsh steps allow full overlapped operation only during 
the last execute cycle. Instructions that require many execute 
mMucrocycies (e.g., (/O instructions) cause the compuler pertor- 
mance to drop down to nearly that of a non-pipelined machine. 


Pipeline Operation with Regard to Branching’ 
and Interrupts 


Pipeline operations greatly reduce instruction execution time if 
machine instructions are executed in sequential order; how- 
ever, if a branch is taken this advantage is lost because the 
steps set up in preparation for a decode cycle become use- 
Jess. The pipeline is said to be “flushed out" when a branch ts 
taken. The RX Branch on Condition instruction has the form: 


Where: M is a 4-bit field specitying the conditions for the 


jump. 
(Xz) + displacement is the branch address 


Figure 17 shows the sequence chart for a RX Branch on 
Condition instruction. During the microcycle A, the target ad- 
dress K for the branch is formed and loaded into the MAR 
and also the instruction B is fetched for the no branch case. 
By microcycle A, it has been determined to take or not take 
the branch. If the branch is not taken, the MAR is loaded with 
address B+2, while if the branch is taken, an instruction fetch 
is performed for K and the MAR is foaded with K+2. Finally in 
Ag the next instruction is decoded. By proper microcoding, the 
conditional branch is executed in only three microsteps even 
though the pipeline was “flushed out”. 


A, B, C, D are RR instructions 


Figure 16. Register-to-Indexed Storage Plpcline Operation. 
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A = RX Branch Instruction 
= Next RX Instruction if branch is not taken 
= next RX tnstruction if branch is taken 


Figure 17. Branch on Condition RX Pipeline Operation. 


As with branching, an interrupt response alters the sequence 
of execution and “flushes” the pipeline. As was discussed 
previously in the Interrupt arid Input/Output section, an inter- 
tupt request blocks the decoding of the next machine instruc- 
tion and causes the Computer Control Unit to vector to the 
interrupt service rouline. This microcode service routine 
pushes the PSW consisting of flags and Program Counter 
(PC) value onto the stack. The PC value is the current PC 
value minus 4. It is necessary to back the PC up to two in- 
Struction words (4 bytes), because the fetch instruction and 
form instruction address steps in the pipeline at the time of 
the jump to the interrupt microcode sequence have to be re- 
peaied when returning to the main machine program. 


MICROINSTRUCTION FORMAT 


All operations of the AMD 16-Bit Computer are under control 
of the microinstruction. Each microinstruction is 96 bits in 
length. The micromstruction forrnat is summarized in Figure 
18. The microinstruction definition is summarized in Figures 
19a and 19b and is Gdetailed in Table 2. 


Figure 20 illustrates the AMDASM” Definition file for the 16-Bit 
Computer. AMDASM’ ts a mel2-assembler developed by AMD 


for writing microprogams. The definition file defines microword 
length (WORD statement), formats (DEF statements) and con- 
stants (EQU statements) for the use of the actual microprogram 
(Figure 31). 


The definition file is divided into 8 parts: 


1. Am2910 sequencer opcode definitions 
2._Am2903 ALU opcode definitions 

. AM2901A PCU opcode definitions 

. Am2904 shift mux and status control definitions 
. Datapath control bits definitions. 

Memory control bits definitions 

. Control strobe and control bits definitions 

. Immediate operand field definition 


ONANAL 


Am2910 Sequencer 


Bit 91 of the microword is the inpulof CCEN of the Am2910. When 
bit 91 is a logical 1, the conditional operations are forced to 
unconditional operations. Bits 19-16 are the input to the instruc- 
tion inputs to the Am2910. Bits 11-0 are the jump address field tor 
instructions that need an address operand. 


“DATA MEMORY 
| awon | PATH (16) PROGRAM CONTROL (11) CONTROL (5) 


CONTROL CONTROL SEQUENCE NEXT MICRO 
STROBES (6) BITS (8) CONTROL (5) ADORS/AMMEDIATE (16) 


Figure 18. Summary of Microinstruction Word Fields. 
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ROUTE TOS ATB 


TRANSFER Z TO ZI (BP) Z— ZI 
Am2910 


Am2903 IEU WORD BYTE 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 
Am2903 


ENABLE TRANSFER REG. 
LOAD TRANSFER REG. 
I-REG EN CTR 

I-REG INC/DEC 

PCU TRANS CHIP DISABLE 
PCU TRANSFER REG 

LOAD MEMORY ADDR. REG. 
LOAD D-REG. 

LOAD 21 INTO | REG. 
ENABLE ZO -- DA 

ENABLE PSW 

SHIFT CNT Am2910 ADOR. 
BRANCH INSTR. EN 
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Am2901 F —8-O 
Am2901 . 
Am2901 
Amz2901 
Amz2301 
Am2901 
Amz2901 
Am2901 
Am2901 
Am2901 
Am2901 


vSSS9S2S8S6S 1929L9b8 


BUS REQUEST 
MEMORY REQUEST 
HOLD REQUEST 
MEMORY WRITE/READ 
MEMORY WORD/BYTE 


Figure 19a. Micro Control Word Bit Definitions. . 


EN IMMEDIATE — DA BUS 
ROM IREGEN 

tO CONTROL REG EN 
Am2914 INTERRUPTS DISABLE 
Am2914 ENlo-ENIy 

Am2904 SHIFT EN 
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GENERAL 
USE 
CONTROL 
BITS 
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Am2904 EN ZERO 

Am2904 EN CARRY 

Am2904 EN SIGN 
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Figure 19a. Micro ontrol Word Bit Definitions (Cont.) 
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ROM/MREGEN | ¥O Control Register | Am2914 Ip-4, | Am2904 Shift Enable 
Bit 46 


Figure 19b. Detailed Description of Sits 34 through 47. 


Table 2, Microinstruction Definition. 


Definition 


Routes second register field to B-RAM of Am2S03. 
Loads the value ir: the Z reg.ster into the Z, Register at the beginning of the raeoorte 


Enabies the CC input of the Am2910. 


These bits control the four Am2903's. The function of EA, OEY, OEB, and fg. is listed in 
Figure 20 WORD when enabled (LOW) causes the Am2903 s 12 operate on words (16-bits). 
When disabled (HIGH) the ALU operates on bytes (the least significant byte). Th:s bit disabled 
biocks WE to the upper two Am2S03's and tums off their Y outputs. 


Zeroes should be forced to the upoer 8 bits af the Y bus via the PCU to allow the zero status 
to operate correctly when the WORD bit ts disabled. Also, when disab'ed the status (C. OVR, S) 
sert lo the Am2904 is taken from the second Am2903 (numbenng 0-3 least significant to 

most significant slice) instead of the most significant Am2903. 


Enable Transfer Register - enables the Transfer Register onto the DA input bus of the 
Am2901A's and Am2903's. 
Load Transfer Register — loads the Transter Register from the Y bus. 


Enable | Register Counter - enables the I Register Counter (I7.14) to count. This value 1s 
used to address the general registers during slack instructions and by incrementing or 
decremenung this value the microprogram can read or write successive registers. 

{ Register INC/DEC — the value in ly.,4 can be either incremented (if this bit rs HIGH) or 
decremented. 

PCU Transceiver Disable — when HIGH this bit disables the PCU Transceivers from receiving 
or transmitting data. 

PCU Transceiver Control — when HIGH this bit allows the PCU Transceivers to pass data 
from PCU to the Y bus. | WORD high (microbit 90) disables the least significant 8 bits of 
these transceivers.] When LOW data passes from the Y bus to the MAR. 

Load Memory Address Register (MAR) — this bit loads the Memory Address Register. 

Load D Register - this bil loads the D Register with data from the Y bus. 

Load Z, into ! Register - this bit loads data from Z, into the | Register. The | Register holds 
only the upper 16 bits of the instruction. 

Enable Z) — DA — this bit LOW enables the Zp Register onto the ALU DA. 
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SHTCNTEN 


. 


Table 2. Microinstruction Definition. (Cont.) 


Enable PSW - this bit LOW enabies the PSW onto the ALU DA. 


Shift Count to Am2910 - this bit LOW enables the least significant four bits of the instruction 
(Ip.3) Onto the D input to the Am2910 sequencer. This allows the value to be entered into the 
Am2910 intemal counter to be used during shift instructions. 


Branch Instruction Enable — this bit LOW enables I4.7 of the Instruction Register onto the 
Am2904 |p.3 input. The Ip.3 inputs control the tests of the status register. 


These bits contro! the PCU which is designed around four Am2901's. The PCUl,, PCUI3, 
PCUlz, PCUI, and PCUl, bits connect directly to the Am2901 ly, 5. Iz, |) and Ip respectively. 
The PCUA2-PCUA, and PCUB2-PCUBy connect to the A and B Address inputs olf the Am2901. 
tg, Is, Ig, Ag and By are tied to ground. I is tiedtol, ~ 


Request Bus - this bit requests use of the system bus. This request is made the microcycle 
preceding a Memory Request or use of the bus for an V/O transfer. If the request is not honored, 
the processing of the next microinstruction is halted until the acknowledge is issued. 


Memory Request — this bit requests the memory to do a read or write operation. 


Hoid Request - this bit LOW blocks the bus controller from releasing the system bus to 
another device. Normally a Bus Request is cleared as soon as the Bus Acknowledge is issued. 
HREQ holds Bus Request and prevents any other device from using the bus. 


Memory Write/READ - this bit indicates to the memory the MREQ is for a write operation 
(if HIGH) and a read operation (if LOW). 


Memory Word/BYTE — the Memory Word/BYTE microbit specifies whether the memory 
operation will be a word operation or a byte operation. If the operation specified is a byte 
operation the least significant address bit determines which byte of the two byte pair in memory 
is affected. If the LSBit is a zero, the most significant byte is read or written, and the LSBit is a 
one, the least significant byte is read or written. 


EN Immediate DA Bus — this bit LOW enables the 16-bit immediate value (least significant 
16 bits of the microinstruction) to the ALU DA bus. 


ROM/I REG Enable — this bit enables either the ROM bits 42-35 or the | register bits lp.7 onto 
the A/B address inputs of the ALU according to the following: 


| Boa | 4o3 MPA-695 


/O Control Register Enable — this bit loads the lO Control Register with microbits 42-35. 


Am2914 Interrupt Disable — this bit disables the Am2914 Interrupt Controller from recognizing 
interrupt requests. 


Am2914 ENlg-ENly — this bit is the instruction enable for the Am2914. The instruction Inputs Ip.5 
are connected to microbits 35-38 respectively. 


Am2904 Shift Enable — this bit is connected to the shift enable of the Am2904. The shift 
controls ig.49 are connected to microbits 35-39 respectively. 


Table 2. Microinstruction Definition. (Cont.) 


This contro! field is used to provide several different functions as defined by the previously 
described control strobes (microbits 47-43). 


OUT EN CONDITIONAL TEST 
EN ZERO . 

EN CARRY 

EN SIGN 

EN OVERFLOW 

EN MACRO STATUS 

EN MICRO STATUS 

CARRY OUT CONTROL 
CARRY OUT CONTROL 


These bits are used to contro! the Am2904. Their functions 
are defined in Figure 21. OECT is used to enable the test 
output of the Am2904 to the CC input of the Am2910. 


These bits determine which test is to be performed for the conditional branch and stack 
functions. The various tests are listed in Figure 25. The testing is done both in the Am2904 


and an 8 to 1 multiplexer. 


291013 
291012 
291011 
291010 


OeaxNHNAODN A 


Am2903 ALU 


The first 16 equates assign mnemonics for the 18-I5 of the 
Am2903 which controls the destination of the ALU result. The 
next 16 equates assign mnemonics for [4-11 of the Am2903 
which control the operations of the ALU. The ALU definition 
indicates the default is the Y bus forced to zero with no operation 
On destination. The next group of definition selects the source 
Operand, followed by the special function definitions of the 
Am2903. 


Am2901A PCU 


The PCU definitions include a group of often used PC instructions 
Such as PCU. NEXT, PCU. JUMP etc. The PCU definition itself 


These bits are connected to the I3.9 inputs of the Am2910 to ‘control 
the sequencing of the microprogram. Their definitions are listed in Figure 26. 


These bits provide the branch address for the Am2910 and the 16-bit immediate field. 


allows a not predefined instruction be accessible to the micro- 
programmer. - 


AM2904 Shift Linkage Multiplexer and Status Register 


The group of equates control the updating of the status register 
and the TEST definition controls the shift hnkage multiplexer. The 
carry control controls the carry into the least significant Am2903 
slice. 


Datapath Control 


The data control equales assign mnemonics to different datapath 
control bits. 
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Figure 20. Definition File for 16-Bit Computer. 


ANDOS/29 AMDASM MICRO ASSAMBLER, V1.2 
BEFIMITION FILE FOR 16 DIT COMFUTER 


DEF Bea, 95k sROUTE Ri TO BAN BD 
EQUATES FOR DATAPATH DEFINITICN 


32 REC TO 21 AEG 

SEMAPLE THANSFER BESISTER 
FLOAD TSANSTER REGISTER 
GI-BEC Em CTR 

31-AEC Incsorce 

sPCU TRANSCEIVER TO 1-BUS 
SPCU TRAMSCEIVER [TO MAP BUS 
37CU TRANSCEIVER CEIP DISABLE 
SLOAD mae 

sLOaD D REG 

SLCAD ZI ImtO I-BES 
TENABLE 2@ TO DA 

SENADLE PSb 

sSEITT Cur 291€ aDDe 
SSRAMCH INSTRUCTION ENABLE 


DATAPATE DEFIMITION 


t 
DATAPATE: DEF SL, 1VB0C, 142, 1VB01, LVRCO, LVBS1,1V309,ZVR9I2 LTRS, 
4 LVDOG,AVBOL,LVBSL AVIS, L7POL LIBEL, 55K 

EQUATES FOR MEMORT COnTROL 


3 
Bt) Ty Qu Ber s3US REQUEST 


337 


SMENORT PEGCLST 
SHOLD MEQUIST 
Ere uyTr 


sMENGAT WoaD/DItEe 


TGC Bei 


DEFIngiom FOR marorT CowraoL 


‘ 

AIM .COMTIDEF 422, 10D00, 10208, 10701 ,1UDed 1 TRed, 49% 
‘ 

+ EQUATES FOR COmTROL STROBES 


SRGM/DeEC TAS 

$1/0 CCeTROL EEC. EMASLE 
SUMTIBAIPT DISADLE 
SEMAILE 20-13 CH APIIIG 


aor: gt M1 
1Oxm: = fQu 368 
Ierilss EGU see 
IsTALBartQu Bee 
CONTROL STACHE DUFJaTICS 

DEF 481, 10dee, 17901, 10301 Lede sad 
CoamoL BITS COvinizice 
DKF 532, 61B8,35I 
IMAELISTE POM BEV LEITION 
DAY @73, 208,522, 160Pe 
tea 


TOTAL PEASE 1 ERROAS = =O 


goo 
3 
= 


vl 


AEs SEMADLE IMPALTSTE OPERASD 


Figure 20. Definition File for 16-Bit Computer (Cont). 


Memory Control 


The memory contro! equates assign mnemonics to different 
Memory control bits. 


Control Strobe and Control Bits 


The control strobe equates assign mnemonics to the control bit 
strobe signals. The control bit definition defines a hexadecimal bit 
Patiem for the 8 control bits. 


Immediate Operand 


When the Am2910 sequencer is executing an instruction which 
does not require an address operand, bits 15-0 in the microword 
can be used as a 16-bit constant to load ALU, PCU etc. This is 
accomplished by putting the constant in bits 15-0 and force bil 48 
Wo logic 0. 


MICROCODE 
Flowcharts 


The flowcharts of the major instruction types are shown in the 
following figures. 


Figure 21 illustrates the basic microprogram flowchart and dem- 
onstrates how the pipelining is done in microcode. This figure 
illustrates the sequencing of the computer starting with no in- 
structions in the pipeline. By the fourth microinstruction, the 
Pipeline is full and the CPU can execute for example a mac- 
toinstruction every microcycie. 


Figure 22 illustrates the execution of an RR instruction. During an 
RR instruction, PC-+ 6 is loaded into the MAR and a bus requestis 
issured for the content of PC+6. The contents of PC+4 are read 
into the Z register. The Z ‘rd | Registers are loaded with the 
contents of PC +2. The ins’: u. - nat PC is executed. The input to 
the mapping PROM is loadev’ »th the contents of PC+2. Thus in 
& stream of RR instructions, four Instructions are in progress 
concurrently, 


Figure 23 illustrates the execution of an RX instruction. In this 
ligure the decode operation takes the microprogram to the mi- 
crostep where the form address operation is done. Since the 
decode of the instruction has been completed in the previous 
step, the form address microinstructions are unique to each RX 
instruction in spite of the fact the operation performed is identical. 


LOADS PC INTO MAR. 
REQUESTS BUS FOR INST H. 


LOADS PC +2 INTO MAR AND PC 
REQUESTS BUS FOR ST Not 
READS INST NL 


REQB, KREO 
PC +2-— MAR, PC 


LOADS PC+? INTO MAR ANO PC 
REQUESTS BUS FOR INST No2 


cbc READS INST No} 


PC +2 -— MAR, PC 
Ze zZ,.%— I 
OECODE 


LOADS Z, REG WITH INST N 
LOADS | REG WITH INST N; 
DECODES INST WN. 


Figure 21. Microprogram Start Up Flow Chart. 


From the form address step, the microprogram jumps to 
FETCHOP where the operand is fetched. This step retums to 
where the instruction is actually executed. 


Figure 24 illustrates the execution of an RSI instruction. Atthe first 
microstep, the immediate operand is already in the Zp regtster. So 
the instruction is executed in the first step. The microprogram is 
then jumped to START2 to refill the pipeline. 
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FAOM NST N-1 


REGS, MREO 
PCo2-— MAR, PC 
I~ %.%,— I 
EXECUTE 
DECOOE 


HANOLER 


OPCODE 


Zo*X>—— MAR 


To 
WST Nel 


REQUESTS OUS FOR INST Nod 

REAOS INST No? 

PUTS PC ¢2 INTO PC AND MAR 

LOAOS Z, ANO I WITH INST Not 
EXECUTES INST N 

DECOOES INST Nol 


RA INSTRUCTIONS IMPLEMENTED 


Rt, R2 

Load Register 

Add Register 

Subtract Register 

AND Register 

OR Register 

Compare Logical Register 
Exclusive OR Register 


Rt = (R2) 

Ri = (R1) + (R2), Set CC 
R1 = (A1) — (R2). Set CC 
R1 = (R1) AND (R2), Set CC 
Ri = (A1) OR (R2), Set CC 
Set CC with (R1) - (R2) 

R1 = (R1) XOR (A2), Set CC 


Figure 22. AR Instruction Flow Chart. 


REQB. MREO 


JSB FETCHOP READS INST Net 


REOB, MREO 


REOB, MREO 


FORMS OPERAND ADORESS BY ADOING Z5+X2 
REQUESTS DUS FOR OPERAND 


LOADS PC+2 INTO PC. MAR 
REGUESTS BUS FOR INST N+2 
REAOS OPERAND 

LOAOS INST N+1 INTO Z, 


LOADS PC +2 INTO PC, MAR 


REQUESTS BUS FOR INST Ne}. 


REAOS INST Ne2 


LOADS Z, WITH Nt 


DECODES INST Not 


EXECUTES INST N 


RX INSTRUCTIONS IMPLEMENTED 


Al, X2 (DISP) 
Al, X2 (D) 
Ail, X2 (D) 
Ri, X2 (D) 
Ri, X2 (D) 
Ri1, X2 (D) 
Ri, X2 (D) 
Ril, X2 (D) 


Al = (x2) + D 

(X2) + O = (Al) 

Al = (R1) + 1(X2) + OD}, Set CC 
A1 = (R1) - {(X2) + D}, Set CC 
Ri - (1) AND [(X2) + 0), Set CC 
R1 = (R1) OR [(X2) + D), Set CC 
Set CC FOR (Al) — {(X2) + D} 


Figure 23. RX Type Instruction. : 


REOB. MREO 
PC +2 -—— MAR, PC 
EXECUTE 
JMP START2 


REOB. MAEO 
PC +2-— MAR. PC 
Z—2,.%4,—1 
OECODE 


To 


INTERRUPT 
HANDLER 


To 
INST Net 


339 


LOADS PC +2 INTO MAA PC 

REQUESTS BUS FOR INST Nod 

READS INST Not 

EXECUTES INST N WITH IMMEDLATE OPERAND IN Z, 


JUMPS TO START2Z 


LOAD PC-2 INTO MAR, PC 
REQUESTS BUS FOR INST Wed 
READS INST Ne? 

LOADS INST Net INTO 2, 
LOAOS IREG © 

OECODES FOR INST Net 


IMMEDIATE INSTRUCTIONS IMPLEMENTED 


OP CODE R1, DATA 
Load Immediate 
AND Immediate 


OR Immediate 


Exclusive or immediate 


Add Immediate 
Subtract Immediate 


Compare Immediate 


Rt = DATA 

R1 = (R1) AND DATA, Set CC 
R1 = (R1) OR DATA, Set CC 
Ri = (R1) XOR DATA, Set CC 
R1 = (R1) + DATA, Set CC 
Rt = (R1) — DATA, Set CC 
Set CC with (R1) - DATA 


Figure 24. immediate Instructions. 


Figure 25 illustrates the execution of an unconditional branch 
instruction. At the first microstep the displacement is already in 
the Zp register. The branch address is formed by adding the 
contents of the Zp register fo the contents of the index register X,. 
The MAR is loaded with the branch address and a bus request is 
issued for the contents of the branch address. The branch ad- 
Gress is also loaded into the transfer register for subsequent 
loading of PC. In the next step, the contents of the transfer 
fegister+2 is loaded into the PC and MAR. A bus request is 
issued to BA+2. The content of BA is read. The microprogram is 
then transferred to START2 to fill up the pipeline. 


Figure 26 illustrates the Conditional Branch instruction. In step 1, 
unlike the Unconditional Branch instruction, the contents of the 
memory (instruction N+ 1) is read, in case the test condition fails 
and the macro program falls through. The condition test is en- 
abledin this step. If the test passes, the microprogram transfers to 
Unconditional Branch routine. If the test fails, the microprogram 
Proceeds lo fill the pipeline and continue. . 


Figure 27 illustrates the branch and link instruction. The flowchart 
is similar to Unconditional Branch except an extra step (STEP 2) 
ts inserted. This step saves PC in Ry. 


Figure 28 illustrates a shift or rotate instruction. In STEP 1 the 
' opcode of the next instruction is loaded into Z, registers and the 
: Shift count of the shift instruction is loaded Into the loop counter of 
«Am2910. STEP 2 executes the shift instruction N+ 1 times, where 
I Nis the shift count in the instruction. It should be noted that since 
4Am2910 detects — 1 as the stop condition, the shift count loaded 
¢ should ba ong less than the desired count. Step 3 is the same as 
Bthe ANI (request next instruction). It is duplicated because the fait 
econdition of RPCT in Am2910 can only fall through. 


LOADS MAR AND TREG WITH 
BRANCH ADDRESS 
AEQUESTS BUS FOR tNST AT BA 


REQUESTS BUS FOR INST BAc! 
READS INST BA 

LOADS MAR AND PC WITH 
ADORESS OF INST BA+1 


REQB, MREQ 
TREG+2 -— MAR 
TREG +2 —— PC 


REQUESTS GUS FOR INST GAc2 
READS INST BAol 

LOADS INST BA INTO Z, ANC! REG 
DECODES INST BA 


REQB,MREQ ~ 
PC+2-— MAR, PC 
Zr U.2,— I 
DECODE 


GA = BRANCH ADORESS 


BRANCH 


8X X;, (0) 


Figure 25. Unconditional Branch. 
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INSTN-1 


REQB, MREO ~ 
Zy*Xz— MAR 
ZX; — TREG 

BRIEN 


REQUESTS BUS FOR BRANCH ADOR 
REAOS INST Neo} 
LOADS MAR ANO TREG WITH BRANCH ADOR 


BRANCH 
CONDITION? 


REQUESTS BUS FOR BA+1 
READS INST AT BA 
LOADS PC, MAR WITH TREG+2 


REQUESTS BUS FOR INST N+2 
LOAOS PC AND MAR WITH PC +2 
LOADS Z, WITH INST Not 


REOB. MREQ 
TREG +2 — MAR, PC 


REQUESTS BUS FOR BA+2 
READS INST AT BA+1 

LOADS PC, MAR WITH PC +2 
LOADS Z,, IREG WITH INST AT BA 
DECODES FOR INST AT BA 


REQUESTS BUS FOR INST N¢3. 
READS INST N+2 

LOADS PC, MAR WITH PC+2 
LOADS Z, WITH INST Not 


REQB, MREG 
PC +2 -— MAR, PC 
Z—2,.%,—1 


REOB, MREQ 


DECOOES FOR INST Not 


DECODE 


TO INST AT BRANCH ADOR 


Figure 26. Conditional Branch. 


Figure 29 illustrates the input insfruction. In STEP 1, the I/O Port 
Address ts formed by adding Zp and X2. Bus request is issued for 
the UO Port. The desired width of the l/O read pulse is loaded into 
the Am2910 Loop Counter. The width of the VO read pulse is 
(N+2) X cycie time where N is the number loaded. The I/O read 
signal ts tumed on. In STEP 2, the bus is held tor the /O address 
and the loop counter is decremented until it becomes -1. In 
STEP 3, |'O read pulse is turned off but YO address is held for 
possible address hold time requirement of the /O device. On the 
trailing edge of the lO read pulse, the content of the I/O Port is 
Strobed into the Zp register. In STEP 4, the content of Zp register 
is loaded into RA;, thus completing the VO read. Bus request is 
issued for the next instruction and microprogram jumps to 
START71 to refili the pipeline. 


Figure 30 illustrates the output instruction. In STEP 1, bus request 
is ussued for the l'O Port Address. In STEP 1, the content of R, is 
transferred to the D register for outputting to the data bus. The /O 
wnite pulse is set and the width of the write pulse is loaded into the 
Am2910 Loop Counter as in the input instruction. In STEP 3, tho 
VO address is held untilloop counter becomes —1.In STEP 4, the 
content of the D register is strobed into the I/O Port by turning off 
the VO Write Pulse. The microprogram jumps to START to refill 
the pipeline. 

The Figures 21-30 illustrate the major instruction types im- 
piemented. These are by no means the only possible instructions 
for the 16-bit computer described. Some other instructions such 
as stack instructions are shown in the microcode but not in the 


figures and should be easily understood with the above examples 


as a guide. 


Figure 31 illustrates the implementation of some typical instruc- 
tions. Instruction 0 is the restart instruction. It jumps to INIT 
which is located in location H#180 because the mapping PROM 
maps only into the first 256 locations. So it is desirable to pre- 
serve these locations for Macro instructions. The initialization 
routine does the following: 


. Tum on I/O reset signal and jump (inst H#0) 
. Set Ro in ALU to O (Inst H#180) 

. Set Rg in PCU (PC) to 0 (Inst H#181) 

. Set R, in PCU (SP) to H#4000 (Inst H#182) 
. Set R, in PCU to 2 (Inst H#183) 

. Set Rs in PCU to 4 (Inst H#¥184) 

. Turn off VO reset signal (Inst H# 185) 

. Initialize consde USART (Inst H#186-H# 190) 


BNA nA AWN = 


The microinstruction that executes macroinstructions are 
grouped as follows: 


Type Figure Microlinst # 
(Hex) 

RR Instructions 22 005-008 
RX Instructions 23 00C-01B 
RSI Instructions 24 010-022 
Branch Instructions 25-27 023-02A 
Shift Instructions 28 028-042 
input Instruction 29 043-046 
Output Instruction 30 047-04A 
Stack Instructions = 04B-059 

OSA-061 


Interrupt Instructions = 


en, te ee 


. 


Zo+Xz—— MAR 
29¢X; = REG 


REQS, MAEG 
TREG+2—— PC 
TREG + 2+=-MAR 


‘REOS, MREQ 
PC+2— MAR, PC 
Z—7Z,.2:—1 
OECODE 


BAL Ry. X; (0) 


INSTN EXECUTE SHIFT 
OR ROTATE 


REOB, MREO 
PC +2 —— MAAR, PC 
z-! 
DECOOE 


To 
INTERRUPT 
HANOLEAR 


INST Net 


LOADS MAR AND TREG WITH 
GRANCH ADDRESS 


SAVES PC INR, 
REQUESTS BUS FOR INST N+! 


REQUESTS BUS FOR INST Neo? 
READS INST Net 
“LOADS TREG*2 INTO PC, MAR 


REQUESTS BUS FOR INST Ne3 
READS INST Ne2 

LOADS PC +2 INTO PC, MAR 
LOADS INST N+1 INTO Z, AND IREG 
DECODES FOR INST Net 


R, = (PC). PC = (X2)+D 


LOADS INST Net INTOZ 
LOAOS SHIFT COUNT OF INST N 
INTO LOOP COUNTER 


EXECUTE 1 CYCLE OF SHIFT 
OR ROTATE OECREMENT 
LOOP COUNTER 


IF LOOP COUNTER « —1, 
GO TO NEXT INST 
OTHERWISE, REPEAT CURRENT INsT 


REQUESTS BUS FOR INST Ne3 
READS INST Ne2 

LOADS Z, WITH INST Not 
DECODES INST Ne} 


Figure 28. Shift and Rotate Instructions. 


REOS. HREQ 
2X, — MAR 
1OCNT — CTR 

SETIOR 


REQUESTS SUS FOR INPUT PORT 
HOLDS THE Bus 

LOACSS MAR Wil 10 AGORESS 
LOADS 10 COUNT INTO 

LOOP COUNTER 

TUANS ON IO READ 


HOLDS SUS 
DEC. LOG? COUNTER 
UNTL CTA -1 


HOLDS BUS 
TUANS OFF 10 READ 
READS 10 PORT CONTENT 


AEQUESTS BUS FOR INST Net 
LOADS 10 POAT CONTENT ovTO Al 
AE-FILLS PIPELINE FROM “STARTIC 


Figure 29. Input Instruction. 


REQB. HREO 
X,¢Zy— MAR 


SET (OW. HREO 
R,-— OREG 
1OCNT-— CTR 


RESET 1OW. HREQ 
JMP START 


TO START 


LOADS MAR WITH 

OUTFUT POAT NUMBER 
REQUESTS BUS FOR OUTPUT 
WOLDS THE BUS 


TURNS ON (0 WRITE 

HOLDS THE BUS 

LOADS OREO WITH AI 

LOADS LO CNT INTO LOOP COUNTER 


HOLDS THE BUS FOR 10 ADORESS 
DEC LOOP COUNTER 
UNTIL CTA 2 =1 


WRITES THE CONTENT OF ORE 
INTO OUTPUT PORT 

HOLOS THE BUS 

JUMPS TO" START 

TO REFILL PIPELINE 


Figure 30. Output Instruction. 


341 


342 


Upon an interrupt, the 16-Bit Computer finishes its current in- 
struction and jumps to microinstruction H#1FF. The interrupt 
handler works as follows: 


1. Current PSW is stored in DREG and SP = SP-2 (inst 
H#1FF). 

2. The content of PSW is written onto the stack In memory. PC = 
PC—4 to flush out the pipeline (Inst H#1F0). 

3. SP = SP-2 (Inst H#1F1). 


H#1F2). 

5. The content of the PC is written onto the stack in memory and 
the vector in the Am2914 is output to the interrupt vector 
PROM. A vector jump is made following thls instruction de- 
pending on the interrupt number (inst H#1F3). 


The content of the adjusted PC is written to the DREG (Inst 


6. The vector jump directs to 1 of 8 locations labelled INTg-INT,. 
For INT,-INTy, the first instruction disables interrupt in the 
Am2914 and forces new PC value into PC. INT» requires an 
extra instruction to clear the Am9519. The interrupt vector in 
the Am9519 is to be determined by the macro interrupt 
handler. 

7. This next instruction is the same as the START instruction. 
The previous instruction cannot jump to START directly be- 
cause the immediate operand uses the jump address field. 
The macroprogram resumes at the new PC value. 


The instructions implemented cover only a small portion of all 
possible instructions. Only 137 or 512 microinstructions are used. 
The rest of the instruction space could be used to vastly enhance 
the Instruction set such as byte operations, storage to storage 
instructions, etc. 
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erereces eres 
s osu nn 
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were 


| Sahehahaia as ~hehehaheiatas hehehehe -tetehehetetateteteta te 


NNO cree 
= 
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Oc ceovercccocccces cusevesvesoesoee 


AICRCPROSRAN FOR APO 15-3IT COMPUTER 
WRITTER BT STIVE CEEBS 9/78 
BEUISICH 1.2 32/15/78 


C0 coee coecece cecsccoeoevecsoreesoor® 


RESET SEQUESCE STARTS EWE 


BLD © WORD & CCWIBCL .ICEN,JNTDIS. & CHTLD BevP & DATAPATE & 
WEM.COST BEGB,,BAEQ,,PwORD & AP2904 & PCU.NOP & INP INIT 


RIQCESY BCS PCR insysucTIoOs & 


ALU TBCS,PASS & AD G WORD & OFT & CARRICTL & CowTBOL & 
BATAPATE .o050cLEMARy voooe & MEN.COGT REGD,«,,PVCAD & 
Anzyoa & BCB-SOP & CCHT 


BESTIST 93S FOR JaSTRUCTICN Mel. READ LusTRUCTION # 


ALO TUCS,PASS & AD & VORD & CET & CARRICTL & CCHTROL & 
EATAPATE o.042cLEPAR, perce & MEM.CONT BEQD,PREQ,,,mVORD & 
arzse4 & BoD EET & Coat 


ABGUEST BUS POR 
BCAD Zi RESISTEL 
BECCSE FOR Imstace 


429 THUS, PASS & Bad © CARRICTL & OFT & WORD & ConTROL & 
DATAPATS IIL, ,555,hEMOR, LIT poe5 & MEN.COMT BEQDABEQ,,.MVORD & 
anzoe4 & PCO.BEIT @ JraP 


BESUEST BEIT lasTAccTION 

Alo T2U3,Pass & Asi t3 cuuarert & OST & WORD & ConTROL & 

ETeeaTE « G6 MEN.COMT REQE,MEEQ,.,.MVORD & 
+ 


wecccascce 


BA TIPE LasTaccricns 


we cccccccc ccccccccccccccccccccccescccoe 


oct ION as Ad ed ImTROCTION hel. 
Jastavetl 


ACAD BECISTER . ae a CC: mOaE 


22 81,22 Bie (22) 


ALU REZ,PASSR & AD & CARDICTL & OFT & VORD & 
EATAPATE Z21,,,5,,LD542..251..,, & MEM.CORT REQB,MAEQ,».MVORD & 
AMZ906 & COATPOL € PCU.SELT & JraP 


a an ccs csvz 
Ri = (ha) ¢ (R22) 


ALU RIES.ADD G AY G CAARTCTL & CET & vO—AD & 
DATAPATE 221,,,,,ohfPAB,, 211,,., & MEN.COBT RIQD,MAIQ,..MYORD & 
nzeee ,.12,00,85; TROL & PCU.REIT & JPaP 


seereece anis-us ar ter csvz 
Bi ,R2 Bre (Bs) = 2)” 


CTL COLI] & CET & WORD & 
T1oee. & MEN.CCHT 2LIB,AREQ,..mVORD & 


acra.cmm, & 


atu ars. S0B8 oa it ¢ 
Bavarata iit 


arzves .,32, mM, & COBTRCL & PCU.MEIT & IMAP 
and BISISTERS 4 ma CC: csvz 
BR B12 as © (31) asp (xz) 


ALU RL>,seD BADE Breet: G CEI G&G WORD G 
SATAPATN 221... AR, TIS,0,. & MEIN.COST Sean eee tea erwone ‘ 
anzee 02,00, 55, oCEN, © COmTROL & PCU.BSIT E 


OU BESISTEDS a 
on 21,82 a1 © (a1) on (2) 


ALy 225,08 & Pb cana cre G CET & vORDE 
ataracta Hts B Lowan, .201,,,., 6 MIN.CONT ECB AEDs ss MVCRD t 


ces csvz 


_ 


Anza ..82 oLOU2 CIN, & COBTROL & PCB 
COMPARE LOGICAL BECISTIRS an ccs csvz 
CLA Bs, Bz cc = BESuLY OF a - (22) 


COmTEeTs OF Ri ABD B2 ARE BOT AYPECTED 


TeTL COEQ] & CET & woAD & 
Ti ,ce- & MEM.COUT 2EQD,MBEQ., MVORD & 
5m, & COMTROL & PCU.REIT & JPAP 


aly T9cS,5CPR 6 Ave Cc 
DATAPATH 220.5455 6L0M 
AM25€6 .,82,10,83,80 


eeec 


veut 


aoe 


ees 


0037 


(i weet we eres come eces NA peer eres 


NN ee SN ere es 


be oe weet ee 


Reem NN 


LH H HHA HNN Been HNN 


SA grt ete ss as 


bo 


Parties OB BEsIsTees - 8 e¢s csvz 


Ton 2 Ri © (hi) OR an 
ALO DIC,ELCR & AD & CARRICTL © ORT & WORD & 


EATAPATH I20,,,..,L5¢42,, 211,00, & MEM.CObT REQD,PREQ,, mVCRD & 
AM2666 ,,32,EC,EBS,OMR,CEN, & COMTRCL & PCU.SELT & JMOP 


Peceneccescesccsucccscococcessccecease 


AZ TIPE imstaccrioss 


an) 3 cc: aces 


p se 
Ba, 32(B) Ba ((32) © Dd) 
ALU TDUS,UDD & DAD & CARATCTL & OLY & WORD & CoMTROL & ETDS 
BATAPATE 4.,,,TPAR,LOMAB,,,EN20,5, & MEM.CONT RECD.MAEQ,,.PVOED & 
AnzG04 & PCULROP G ISB FETCHOP 


ALY AIS,PASSR G DAD & CARBYCTL & CLT & VORD & COKTROL & 
Darapat oo oLDPUah, 211, 2920,,, & MEM. COBT REQB,AREQ,..MVOBD & 
anzged & bCULTIT & ? 


* 2 se 
(zz) ¢ BD © (Ra) 


TCTL & OFT & VORD & CCATACL & BTB EE 
ooEN20,,, & MEN.COMT REQ, MAEQ,,,PUCED & 


STCAR ar Cs BOSE 
ST 21,32(D) 
atv rats, 


EATAPATH ,.,,.TRAR, LED 
AMZ626 & PCU.ROP & CONT 


ALU T2US,PASS & AD G CORRICTL & OFT & VOaD & CONTROL & 


PATAPATH 221, oo & MIM.CONT BEGB,,..MVORD & 


aracaa & PCU. 


ALJ IBUS, PASS & ad & C 
EATAFATP ...,..L0m 
ancoes 6 PCULRIIT & Jet int 


TCTL & OFT & WORD & CONTROL & 
& MEP.COWT RLQD,PREQ, .URITE, WORD 6 


ADD epee ax cer csvz 
ADD B1,22(E) «= BA (R21) © (Caz) © BD 


ALU THUS,ADD & CAD G CARRICTL & OFT & WORD & CONTROL & UTP 
DATAPATF 4 y45eTPAP,LOMAR,, .LN70,4, & PEN.CORT BEQbOnREQs seP¥CRS 6 
An2994 & PCULNCF & JSD TETCROP 


STL & OFT & vOa> & CONTROL & 
oIN26,,, G& MEM.CCHT AEQB,MREQ,,,MVORD & 
CUM, & PCULMEZT & JMAP 


susrascr 38 ar cer cs8s 


SUS 22,2210) = Ba © (PD = ((xz) © 2) 


ALU TDUS,ADD & DAP & CARETCTE & OLT & VORD & CCATAOL 6 RTD 
DATAPATE ,.,,,TRAR,LDPUR,, ,LXZ2,,, & MEN.CORT REQD,MREQ,,.nVORS & 
arzgea & bcbl scr a’ ssa ritcuor 


ALU REC,SUPR & DAD & CARRYICTL COEQ] & OFT & WORD & COMTROL & 
DATAPATE sues oLL™a AR, .ZIT,ENZI,,, & PLM.COWT BLIP MALO,» -MVORD 6 


AN2¢0@4 ,,EZ.EC,ES,LOVR, CDI, & PCU.MEXT & JMAP 
a cer csvz 


and “ 
Pe tretiid) Bi © (ma) and ((22) © BD) 


AL” TOUS,ADB & DAP & CARATCTL & CET & VORD & CowrTROL & ATDE 
DATAPATE BNLOy 4, & MDT.COMT REQE,MAEQ,,.nvCED & 
arases 6 

ALU BIC, AND G DAD & CARAICTL & OST C WORD © CONTROL © 

DATAPATE oooocoLDMAR, 213, ENZO,,, & MEN.COMT REQD,MREQ,, oMVORD & 
AM2966 ,,EZ,EC,ES,£OUR,CIM, & PCU.MEIT & JAP 


-- CCs csvz 


on C4 
© 21,22(D) By © (ma) on (2) © BD) 


ALU YBUS,ADD & DAB G CARATCTL & OET & YORD & COnTROL & ATDE 
DATAPATZ o44, THAR, LDPAR,,.ENZO,,, & MEV.CONT REQD,MREQ,,,MVORD & 


Anzooe & PCU.SOP & ISB FETCROP 


‘ CAnaresE & OZT & WORD & CONTROL & 
AAR, -ZIL,ENZO,,, & MEV-COMT REQB,MREQ,_MVORE ‘ 


AM2666 ,,82,EC,ES, EOWA, CIM, oP wer & InP 
ConmpaRe ar ces csvz 
cmp 21,32(D) cc @ RESULT OF ap = ((12) © db) 

TEE COKTIATS OF BL AMD (X2) © D ARE wOT AFTECTES 
alo r3Us, DAD & CABATCTL & OFT & WORD & CONTROL & BIDE 
DararaTe TMAR,LDMAR,..INIO,,, & MEN.CONT AEQD,MEIQ,, mvORD 6 
anzoe4 © Pe & JSB FETCHOP 


EEE EEEEEERSSIEEEREEEEEEET 


Figure 31. Microprogram for 16-Bit Computer. 
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we et 
“SE 20 22 eo ee ws oe 
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NN wen ne: 
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& CAARYCTL COEQS & OST & WORD & CosyBOL & 
T11, THI8,,. & MDV.COMT REQD,MBEQ,,,.mVOED & 
oCEN, & PCU.MEST & JnaP 


OUTINE TC FETCH CPERAND FROM MEMOIT 


ovccvassecoces: 


STCECP:ALU TD03,PA5SR & DAB & CABRICTL & OEY & WORD & COBTROL & 


DATAPATE 221, 
AF2906 & PCU 


o 6 MBN.Conr 


DB MREQ, MVORD & 


LOAD IAMEDI ATE a ast Cs woes 
Lt a1,03 eo | 
ato PASSR & DAB & CARATCRL 6 ORT & vORD G ConTROL & 


Dar LOMAR « & NUN.COmT REQD,MAZQ.».MVORD © 
Anz) TG IMP Stant2 

AOS JED are oe 3st ces Cavz 

TU A1,o3 Bie Ri AnD DBI 


Alu AEC,am> & ee? 6 saparere © O8T & VORD G ConraoL © 


Bavara © MDN. coat Brg, rasa 
anzvea Teen, U'peucagiy 6 MP stants! 

On tmneptars ry ast 1 C89; 
OF 81,51 Ato Ri cn oF eeneaye 


ALO REG.OR & DAB & CARRICTL & ORT & WORD & CONTROL & 
DaTaPacve oLOMAR,.,2620,,, & MDV.CORT EGE AIEQ, o-MVORD & 
AM2006 ,.E2,EC.5S,50UR,CEN, & PCU.BEZT & UMP STARTZ 


ESCLUSIVE OR SrmgDiatE Lad asi Cr cs¥E 
Poe Tit! Bi - Qi OR DE 


ALC RES, EXON © DAD & CAREYCTL © ORY & VORD & CCWTROL 
Davarare ceeeee SEAN oo sO Tee & AEN.COMT RIQB,AREQ,,.MVORD & 
AN2064 ,,BZ, EC, £3, EOUR,CEN, & PCO.MEZT & JNP STARTZ 


IPED ATE oa ast Ces csvz 
al 2s,o1 Br © Br Dr 


ALJ 3ES.ADD & DAB & CAURYCTL & OFT G WORD & CONTROL & 
paz? seegs LDCUR,,,ENZB,y. & MEM.CONT REQD,MRIQ., sVORD & 
ardcaa £2 ,2C lS tovdscon, i’ PcU.mgry & InP Stiat2 


Hie ei Step DIAzE, LD} RSI ccs Csvz 
tI8 eRM- n 


peas ee Dab Caanyert COEQS & OFT & WORD & CouTe 
arty 20,05 & MEM.COMT AZQB, FREQ, 
G'Pcu.mErT G InP SzAaTZ 
“imEsLate 


9 ast ccs cots 
CC © RBSULT OF R1 - DI 
TEX CORTEPYT OF Ai IS ROT AVPECTED 


AL0 TBUS,SCER & Ho 4 CARRICTL COEQL & OBY & VOD & COMTROL & 
DATAPATE ooEHI0,,, & MEM.CORT BEQB,FBIQ,,MVORD & 
arzses ..52, eras: Ove Lemme C’bcv.agst G snp starz 


bab 6 


eooeocore 


BRABCE IGSTRUCTICNS 


Orecceces 


SEARCH UNCONDITIONAL A} a CC: wosEe 


Br2(D) Pom (32) 6B 
ALC T3US,ADD & DAB G CABRICTL & ORT & WORD & CONTROLS RIBS 


DATAPATE , £uTRE o LEP AR eo, ‘ 

AIM.COMT BEQB, omvoar & 964 & PCU.TH2 & IMP STARTZ 

ALU TBUS,PASS & BB & WCRD & OFT & COMTAOL & 

DATAPATE J INTPES, 94. .LDMUBsoeeoe & MEN. cour ite) oneteeamr ne t 
Anz2904 & PCO. TR2'C'Snp STARTS * 


BAANCH ON SOnDET EON 2 eH ax 
aC C.32(d) ita Spe 24, PC © (x2) ° D 
BLSE PC © (PC) © 2 


ALO TDCS,ADD & DAP & CARRICTL C OZT & WORD & ConTROL & RTD & 
DATAPATA’, ,LDTBEG,,,YRUN,LO“AR, , 1MZO,,,9RIEM & TEST 57 6 
MEN.CONT RTQD,ARED,,.mNCRD & 

WZEE4 OLCTe coos & PCU-MOP & CIP BRE 


BRANCD WOT AEEDED ’ 


ALU TBUS.PASS G& AB G WORD & OEY & CONTROL & 
TATAPATE 221,, 
An29a6 & PCU.R: 


BEANCH AnD Link 
BAL M1, 12(B) 


ALU YBUS,ADD & DADC 
DATAPATH ,, LDTAEC,,.THAD,LOMAR,., ENZOe, 
An2034 & PCU.ROP & CONT 


ALU RIC, PASS G DAB & CARBICTL & WORD & CONTROL & 
DATAPATH seveePCUYecvcoee & MEN-CONT REQB.5,.MVOND & 
4n2566 6 PCU.NOP G JMP BEL } 


Baance © RECISTIA es mam... 
DALR RL RZ RL = (PC)..PC © (R2) 


ALU YBLS,PASy— & DAB CCARBICTL & OFT & WORD & COBrROL & 
DATAPATH , .LOTREG », pTMAR,LDMAR,, SENZB egy & AEM.CONT ooo oP WORD & 
anzses & Péu.nop € ‘Jnr Bila 


COs none 


B1 © Pc + 2, PC ater) M3) 


CARATCTL & CEY G WORD G CCnTBOL G RTD G 
G MEN.COMT 45, .MVORD & 
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BLANCHE BESISTES atwit 
ua 21 we tu eR as, eee 
ALD Y80S,PASS & DAB & CARBTCTL & CET & ¥CED & Cont 


EATAPATE ,LOTRE.,.TRAR, LEMAR eyey & MEN.COST EEQ! 


arose 6 bev.nor Gon? 31 sks, : 


Shiv? jusvagcrioes 


SULPY LEVY ARITENETIC GB st Ces es 
Sia 21,CaT Q1 © Serrt (Bt) Litwarte nt eet PLACES 


ALO THUS,PASS & AB & VOD & OEY & CARBICTL & COTECL & 
RTCATID, & MIN.COMT ,,,,PVOED & 


ALO SUR, PASS & AD G WCRD G OST & CaRRTCTE & COMTDOL & CHL) HeTa & 
DATAPATH & MBN.COBT ,,,,mUOED & 
AA2064 SUIYYEN, ,E2,EC £3 LOVE, CEN, & PCU.RGP & BPCT § 


ALU TUCS.PASS & AB . WORD & GET & CaRATCTL & CCwrtCL & 
Batape™e , olor as aE Ib es ob © ABN.CCUT REQD,FAIQ,, .MVOED & 


anzged 6 PCU.mET & di 
SUPT Lerer Locicar 331 cc: Cs¥E 
SLL 21,CHP Bie sabrr tai) tert Loc ical cet rtiacts 


ALU TDUS.PASS & AD G WCAD & OST & CabatcTL & Ceeriot © 
DATAPATE 221, cecovccc ces MTCUTEM, & MEN.CONT 45 ,.M¥CRD & 
arzpes 6 PCU.SOP 6 LOCT 


ALD LUM. Pat3 6 43 ¢ VORD ¢ OFT 6 CANMIETL & COuTseL © CrTLD Bere & 
DATsPays & MEN.CCHT ,,,.pwORD & 
ANZ966 SULVTEN, .£2,55,85, 2072, CIM, 6 \PCU.ROP & EPCT § 


ALD YBUS,PASS & AB GC VOED EG on ‘ (CaRATCTL & fowrRst & 
3aTaPatE Loran. . j-COWT REQD,FREQ,,.MVORD E 


ares & 
SEITT BiCeT AB rersric ec: core 
Sha 21,Cur Bi © Sutst (B8) tear uitmatic Cet PLacrs 


ALO TEUS, PASS & AB & WOLD & OFT & Laaasesi & CesrBcl & 
Davarace 221, coe  SBTCHTEM, & AEN.COMT ,,,,MVCRD & 

wages 6 Peco. woct 

Ala 412,PaSS © 33 & wead & ae G& CARSTCTL & CowTBOL & Curd nets & 


EaTaPata & AEN.CCOT .,,,mWORD 
AAZ664 SELITED, ,.22,EC,23, Tord,cEn, G peu.sor © BPCT § 
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Figure 31. Microprogram for 16-Bit Computer (Cont.) 
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Figure 31. Microprogram for 16-Bit Computer (Cont) 


MICROCODE TRANSLATION , ; 


It is offen convenient for the microprogrammer to assign ml- 
croword fields such that they occupy positions that differ from 
those in the actual hardware implementation. This is often the 
case when the microprogrammer, for convenience, allocates bits 
according to the functions to be performed and then needs to 
translate the object code produced by AMDASM*K to be consis- 
tent with the hardware microprogram memory design. 


There is another instance where the ability to shift bit assignment 
is important to the engineer. As a given product evolves, bits may 
be added or deleted from the original microword format. When 
this occurs, a mapping function is desired to minimize hardware 
changes. : 

The program in SYSTEM/29® that performs such a mapping 
function is called AMSCRM. The AMSCRM maps the output of 
AMDASM (logical bit pattern) into the bit pattern thatis consistent 
with the 16-bit computer hardware. A table of the logical to physi- 
cal mapping is shown in Table 3. 


ENGINEERING MODEL AND MACROCODE 

With the proper tools — designing, microprogramming, prototyp- 
ing, and checking out a new computer design is not overly dif- 
ficul. The major tools used for the high- speed 16-bit design 
described in this application note was System 29'"), System 29 Is 


_a soltware driven hardware prototyping system which allows 


microprogramming, hardware desigr/checkout, and macropro- 
gramming (programming in the language of the target machine) 
to occur simultaneously. At the point where the design is reason- 
ably rigid, and the hardware is mostly fabricated, System 29 
allows the engineer to create “instant” microprograms to check 
out the new computers’ intemal data paths. Microprogram 
soltware support features of System 29 also aliow the engineer to 
single cycle, single instruction step, instruction trace, and trap on 
pre-specified events coming true. Simultaneously with this initial 
internal check-out, the microcode for some very simple machine 
instruction should be wniten (i.¢., load register, add register, or 
register, etc.). The next step is to check out the main memory 
paths with load and store instructions. At this point, a reasonable 
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Table 3. 


BIT ASSIGNMENT FOR 16-337 CCHPUTER 


BID POSITION MpErowie © DESCRIPTION 


st. AEG. FIELD 2 TO B PORT OF ANzge3 
SPARE 
SPAKE 


221 Loa 2 BEC. INTO ZI Kx. 
cCEN © —- AN2910 CONDITON CODE ENARLE 


AM2963 ALU COATROL BITS 


s WORD MODE = 6, BITE MODE = 1 

he EMAPEL & LATCH ON AM2ca3 

® EWABLE Y OUTPUT ON An2925 

e ENABLE DB LATCR ON AMNZ503 
INSTRUCTION LINES YOR AM29@5 


DATAPATE BITS 
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LDTEIG LOAD TRANSPER RIC. 

EnctE 2 I-RIC ENABLE COUNTER 

Inc I-HEG INC=1/DEC*0 

PCcoOcD PCU TRANSFER CBIP DISABLE 
PCUT PCU TRANSFER TO T-RUS é 
LDMAE LOAD MEMORY ADDRESS REGISTER 
LDD LoaD D-RECISTER 

zit LOAD Z1 INTO I RECISTER 

EN2@ ENABLE 2@ BECISTER 70 Da BUS 
PS ENABLE PSW REGISTER TO DA RUS 
SHTCNEN SHIFT COUNT AN291@ ADDRESS 
BAIEW « BRANCH INSTRUCTION EMABLE 


AMZO@14 PAOCDAM CONTROL UNIT 


67 PCoI? Y T0 B-2AM © 1 (DEFAULT), F T0 Q-REG = @ 
66 Pculs ADD = 1 (DEFAULT), SUB = @ 

65-63 PCUI2-8 PCU SOURCE CONTROL ‘ 

62-68 PCUa2-8 PCU A-RAM SELECT 

59-57 = PCUB2-@ PC B-RAM SELECT 


MEMORT CONTROL 


RFQ2 RUS REQUEST 

MREQ MEMORY REQUEST 

BALQ ROLD REQUIST 

WRITE MEMORT READ = © (DEFAULT), MEMORT WRITE © 1 
MWORD MEMORY RITE OP = @ (DEFAULT), MEMORY WORD OP = 2 


CONTROL BIT STRORES 


Imap e IMARLE IMMEDIATE FPILLD TO DA BUS 

2On I-REG ENABLE = @ (DEFAULT), ROM ENABLE = 1 
10ge 1/O CONTROL RECISTER ENARLE 

IxTDIS AP2Q914 INTERRUPTS DISABLE 

INvkI DI AN2914 INSTBUCTION EWARLE 

SEITEN AN2904 SHIFT BNARLE 


GENZBAL PURPOSE CONTROL BITS 


CuTLa7-6 BITS TO ME STRORED BT CONTROL STBORES 
AN2994 STATUS RECISTER CONTROL BITS 


orr OUTPUT EMARLE OF CONDTIOMAL TIST 

a2 » ENABLE ZERO PLAC UPDATE 

te EMABLE CARRY PFLAG UPDATE 

rs IMADLE SIGN FLAG UPDATE 

cova EMANLE OVERYLOW FLAC UPDATE 

cm EMADLE MACHINE STATUS REGISTER 
cru, TMABLE MICROPROCARM STATUS REGISTER 
112 AN2984 112 CARRY OUT CONTROL 

11 AM2004 CARRT OUT CONTROL 


TIST BITS 


TISTS-3 AM2904 TEST BITS 
22-26 = TEST2-8 AMN2964 & AN2SLS25 JEST BITS 


ANZ91@ SEQUENCE CONTROL 

wacs-@ AMN2019 NEIT ADDRESS GONTROL 

MEET MICEO ADDRESS OR IMMEDIATE PIELD 

"15-8 SBAQED FIELD FOR wXIT ADDRESS On Iemp 
END‘ - 


instruction sub-set should be microprogrammed (a phase 1 in- 
struction set) that will allow a simple monitor to be written in the 
target machines’s language. This monitor should run on the 
target machine and provide commands for: memory display, 
memory store and jump to memory location. The phase 1 Instruc- 
tion set and simple monitor now provides the basic foundation for 
completing the full computer design. 


The standard System 29 configuration provides automatically for 
microcode and hardware development In order to efficiently 
develop and implement the target machine's software, a target 
machine assembler and a mechanism for loading the machine's 
main memory must be provided. System 29 uses an AM9080A 
microprocessor, dual floppy disks, and a full function disk 
Operating system to support microprogrammed hardware and 
firmware development. The Am90B0A microprocessor can ad- 
dress 64k bytes of memory. The disk operating system uses only 
the first 32k bytes and the remaining 32k is used to memory map 
(page) functions from the hardware development side. Through 
this mechanism, the designer has the ability to directly load and 
manipulate microprograms, monitor hardware functions, etc. 
There are extra enable lines from the page register which allow 
the System 29 user to map other functions into the support 
Processors upper 32k of memory. 


The main memory of this 16-bit high-speed computer design was 
Mapped into the support processors upper 32k via one of the 
unused page register enable lines. Besides the normal 16-bit 
intertace, a simple 8-bit interface was added to the main memory 
thus making it a simple two port memory. When the 16-bit com- 
puter is halted (via a System 29 command) location 0 of 16-bit 
main memory would be addressed as location 8000 hex of Sys- 
tem 29 support processor memory. Location 1 would be 8001, 2 
would be 8002. etc. This affected a mechanical link between the 
16-bit prototype design and System 29. 


In order to efficiently write a reasonably complex piece of 
software (such as a simple monitor), an assembler for the target 
instruction set is needed. Since this 16-bit computer design is not 
exactly like any other 16-bit computer, ready to run software tools 
are not available. A macro assembler is available as an optional 
enhancement to the System 29 software base. Even though this 
macro assembler is for programming in AM9080A assembly lan- 
guage, there is a user installable patch which will disable all of the 
AmS080A operation codes (Figure 32). With this patch installed, 
the user may now write a macro library defining the target 
machine's instruction set. It is not necessary to code the entire 
instruction set, as the first level of programming for the new 
machine (simple monitor, etc.) will be using only the phase 1 
instruction set. A complete macro library of the AMD high- 
speed 16-bit computer phase 1 instruction set is contained in 
Appendix B. 


Now that the tools are in place, it is relatively simple to code and 
implement a simple monitor for the target machine. Appendix C 
contains the complete simple monitor listing for lhe AMD high- 
speed 16-bit computer. Only the phase 1 instruction set was used 
which does not include byte instruction, call and retum instruc- 
tions, stack instructions, any special instructions, etc. This simple 
monitor understands three commands: Display (D), Store (S), 
and Jump (J). Typing D followed by an address value will display 
256 bytes of main memory beginning on the address given 
(rounded back to the nearest eight word boundary). Typing an S 
followed by an address, followed by data, will store the data 
consecutively, on a nibble basis beginning at the grven address. 
Typing in J followed by an address will cause the processor to 
begin execution at the main memory location given by the ad- 
dress. Commands, addresses, and data must be separated by at 
feast one delimiter (space, comma, or penod). 


The change file shown below can be integrated into MAC to produce a new program, which we will call 
MAC29. The MAC29 program will not recognize 8080 mnemonics, but will recognize all the MAC pseudo 


operators and arithmetic functions. 


? 
3 MACRO ASSEMBLER 
; 


EQU 25 
PT EQU 26 
TAREA EQU 25618 


24444 
TAREA 


ORG 


C36125 JMP 


ORG TAREA 


"MAC™ CHANGES TO DISABLE 8@898 OPCODES. 


780988 REGISTER NAME 
7PSEUDO OPCODE TYPE 
7FREE AREA IN TOKEN MODULE 


sOVERLAY INX H MOV B,M RET 


TYPE IS IN THE ACCUMULATOR 


CPI RT 

Jc TY PEOK 
cer PT 

RNZ 


;BELOW RT IF ARITH OP 


7PSEUDO OP? 
;RETURN WITH NON-ZERO FLAG 


’ OTHERWISE, PSEUDO OP OR ARITH OP 


TY PEOK: INX 
NOV B,M 
CMP A 
RET 


END 


# 
;SET ZERO FLAG 


Figure 32. Macro Assembier Disable Opcode Patch. 
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Atter writing the monitor, and putting it onto floppy disks via the 
System 29 editor, it must be assembled using the modified macro 
assember (descnbed earlier). The result of the assembly is a hex 
file which is suitable for loading into the 16-blt computer's main 
memory. This hex file 1s now toaded into support processor mem- 
ory beginning at location 8000 hex. As discussed previously, this 
1S mapped at location zero in the 16-ba computer's main memory. 
Assuming the microcode !s loaded and a terminalis connected to 
the 16-bit computer, the monitor in 16-bit main memory may now 
be executed. The complete System 29 session from editing and 
assembling the monitor to loading and executing it is given in 
Appendix D. 


SUMMARY 


As can be seen throughout these application notes, designing a 
high pertormance Bipolar microprocessor system is a straight- 
forward task. The Am2900 Famity ts ideally suited to provide 
building blocks for the various elements of the computer. These 
mctude the Computer Control Unit, the Central Processing Unit, 
the Program Control Unit, the Interrupt Structure and the various 
bus controls. Together, these elements allow the designer to 


build computers using the current state-of-the-art architecture 
with LSI building blocks. 


As technology improves, Advanced Micro Devices has been able 
to redesign these building blocks to offer increased performance. 
Thus, the Am2901 has evolved through an Am2901A, then an 
Am29016 and now an Am2901C is in the planning. In addition, the 
Am2903 offers additional architectural advantages and soon an 
Am29103 will provide additional speed and performance fea- 
tures. Similarly, the microprogram sequencer area began with the 
Am2909 and Am2911; then was followed by the larger Am2910. 
Soon, the Am2909A and Am2911A will provide higher speed in 
the microprogram sequencer area and will be followed by an 
Am2910A. ‘ 


Thus, the future for Bipolar LSI buitding blocks includes not only 
more advanced product designs offering higher levels of integra- 
tion and new functions for new architectures, but also offers 
higher performance versions of the already existing products. 
Advanced Micro Devices is committed to providing high perfor- 
mance Bipolar LSI circuits utilizing proven technology designed 
to operate over the full military operating range as well as the 
commercial operating range. As always, these products continue 
to meet the perlormance requirements of MIL-STD-883. 


APPENDIX A 
Complete Description of Instructions 


} 


The second operand is loaded into the general register specified 


by Ry. 


STORE : A t = . — 


The first operand specified by R, is stored at the location 
Specified by the second operand. 


The first operand is added to the second operand and replaces 
the first operand. 


ADD WITH CARRY 


The first operand (16 bits) with carry is added to the second 
operand and replaces the first operand. 
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The second operand is subtracted from the first operand and 
replaces the first operand. 7 =i 


SUBTRACT WiTH CARRY 


The second operand (16 bits) with carry is subtracted from the 4 
first operand and replaces the first operand. 


RX, RSI 


The AND of the first operand and the second operand replaces 
the first operand. 


RX, RSI 


The OR of the first operand and the second operand replaces the 
first operand. 
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x 
fe) 


R 


RX. RSI 


The logical difference of the first operand and the second operand 
replaces the first operand. 


TEST IMMEDIATE 


2 
x 
Bs] 
2 


The first operand and the second operand are logically ANDed. ~ 


The contents of R, and X2 are unchanged. 


The first operand 1s algebraically compared with the second 


* operand. The result is indicated by the condition code. 


COMPARE LOGICAL 


The first operand 1s compared logically to the second operand. 
The result is indicated by the condition code. 


The first operand (R,; - 1) 1s miihipeed by the second operand 
and the 32-bit product ts contained in R, and R, ° 1 registers. Ry 
must be an even address. The sign of the product is determined 
by the rules of algebra. 


MULTIPLY UNSIGNED 


R 


=< 


The first operand (R, + 1) is multiplied by the second operand 
and the 32-bit product is contained in R, and (R, + 1). Ry must be 
even. 


RR 


The 8-bit byte stored in the low order byte of the second operand 
location is stored in the low order byte of Ry. The high order byte 
of the R, is set to zero. 


‘INSERT CHARACTER 


RX, RSI 


The byte al the second operand location is loaded into the low 
order byte of R, without spat the contents of the high order 
byte of R;. a ies 


STORE CHARACTER 


The !east significant byte of the first operand Is stored tn the 
location specilied by the second operand. The other byte of the 
second location is unchanged. 


EXCHANGE BYTE 


The bytes specified by the first and second operands are ex- 
changed. When the operand specifies a register (i.e. Ry, Az) only 
the low order byte is exchanged. : 


2D 
x< 


The two bytes of the second operand are swapped and loaded 
into the register specilied by the first operand. 


COMPARE LOGICAL BYTE 


RR, RS 
i g3 - ._. RX, RSI 


The low order byte of the first and second operands are com- 
Pared. The result is indicated in the condition code. . 


AND BYTE 


Pp] 
2 
2D 
” 
i 
1 
' 


. RX, ASI 
The AND of the low order bytes specilied by the first second 


Operands replace the first operard low order byte. The high order 
byte of Ai, is set to zeros. 


OR BYTE 


RX, RSI 


The OR of the low order bytes specified by the first and second 
operands replace the first operand low order byte. The high order 
byte of R, is set to zero. = 


The XOR ol the low order bytes specified by the first and second 
operands replace the first operand low order byte. The high order 
byte of R, is set to zero. 


LOAD PROGRAM STATUS WORD 


A 32-bit new PSW is loaded from the memory location specified 
by the second operand as the current PSW. 


EXCHANGE PROGRAM STATUS 
RR 


eco] ar 


PSW (0:15) — (Ry) 
R2 -+ PSW (0:15) 


STORE PROGRAM STATUS WOAD 


The 32-bit PSW is stored at the locat‘on specilied by the second 
operand. 
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SUPERVISOR CALL 


OLD PSW — [(X2) + d] 
[(X2) + d] + 4-+ NEW PSW 


SET, CLR, COMPLEMENT, TEST BIT PSW 


The condition flags in the current PSW are set, cleared, com- 
plemented, or tested. N defines the bit(s) to be affected or tested. 


(2) 


ALL 


Jump to the memory location specified by the second operand 
and push PSW (16:31) onto stack. ; 


POP STACK 
STACK — PSW (16:31) 


PUSH 


STACK + R, THRU Rp 


LOAD STACK POINTER 


* LOAD STACK LIMIT LOWER 


LOAD STACK LIMIT UPPER 


STORE STACK POINTER 
STORE STACK LIMIT LOWER 
STORE STACK LIMIT UPPER 


The stack point, stack limit lower or upper is read from or written 
into the address defined by the second operand. 


: Re 
ADDRESS , 


The addresses specified by R, + 1 and Rz define two tables, Ry 
+ 1 address is the top location of a table to be translated, Rz 
address the first location of the translation table. The value (one 
byte) pointed to be the A, + 1 address is indexed by (added to) 
ihe address value of R;z to find the translation code. This transla- 
tion code replaces the value pointed 10 by the R, + 1 address. 
After one byte 1s translated, the length is decremented and the 
address of Ry + 1 incremented and the instruction repeated, until 
the length equals zero. This instruction is interruptable. If this 
instruction is interrupted, the PC is left pointing to this instruction 
So that thts instruction can be resumed after the interrupt service 
is complete. 


TRANSLATE AND TEST an 


R, 


ADDRESS -— 


A, c ‘ 


ADDRESS 


This instruction proceeds like translate except that the bytes of 
the first operand (defined by R,) are not changed in storage. 
When the bytes of the translate table (R2) the instruction pro- 
ceeds to the next byte of the first operand. tf the byte of the 
transiate table is not zero, the instruction is halted with the ad- 
dress pointed to last in the translate table held in register 1. _ 


MOVE LONG meatateniomne 9mm cigre—eaeneeaniat 
: RA bnsi ap tl 

| op] ns | me | a ee ae 
Ry ts ae az ' 


LENGTH 


ADDRESS ~ 
Ro > 


ee 
ADORESS jv tay 4 


Moves bytes defined by R, to R2. Both adresses incremented 
after each transfer, This instruction is interruptable. 
H ! 


COMPARE LONG —--—-— 
a8 . RR 


Ry 


ADDRESS 


R3— 


ADDRESS 


Compares the first operand against the second operand. The 
fength is decremented and the address incremented alter each 
Compare. When Jength = zero of the bytes compared are not 
equal, the instruction is halted. 


EXECUTE 


The upper 16 bits of the instruction at the second operand is 
‘OR’ed with R, and executed. 


OECIMAL AOD 


Nibbles in operand 1 and operand 2 are added. The result 1s 
placed in operand one. 


DECIMAL SUBTRACT 


Nibbles in operand 2 are subtracted from nibbles in operand 1 
and the result is placed in operand 1. 


OECREMENT INDEXES 


RR - 
R,-17R, - 
R2- 1.7 Re 


One is subtracted from R, and the result placed back into R;. One 
is subtracted from Rz and the result placed back into R2. R, and 
Riz may specity the same register with will effectively subtract two 
from that register. 


SHIFT RIGHT ARITHMETIC 
SHIFT RIGHT DOUBLE ARITHMETIC 


The contents of R, for single shifts and A,, A, + 1 for double 
shifts are shifted the number of places specitied by the second 
operand. The sign bitis unchanged. Bits shifted in are set equalto 
the sign bit. Bits shitted out are shifted through the carry bit. 


D 
= 
D 
2) 
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ROTATE RIGHT 
ROTATE RIGHT DOUBLE 


RX, RSI 


The contents of R, for single shifts and R,, R, + 1 for double 
shifts are rotated right the number of places specified by the 
second operand. 


SHIFT LEFT ARITHMETIC 
SHIFT LEFT DOUBLE ARITHMETIC 


RX, RSI 


The contents of R, for single shifts and R; + 1 for double shifts 
are shifted let the number of places specified by the second 
operand. The high order bit (sign bit) of the register a register pair 
is unatfected by the shift. Low order bits are filled with zeros. Ifa 
bit untike the sign bit is shitted out of the position adjacent to the 
sign bit, the overflow flag is set. 


ROTATE LEFT 


The contents of A, for single shifts and R,, A, + 1 for double 
shifts are rotated left, the number of places specified by the 
second operand. 


SHIFT RIGHT LOGICAL 
SHIFT RIGHT DOUBLE LOGICAL 


RX, RS! 


The contents of R, for single shitts and R, + 1 for double shifts 
are shifted right the number of places specified by the second 
operand. High order bits shitted in are zeros, low order bits shifted 
out are shifted through the carry bil 


SHIFT LEFT LOGICAL 
SHIFT LEFT DOUBLE LOGICAL 


The contents of R, for single shifts and R,, AR; + 1 for double 
shiftz are shifted left the number of positions specified by the 
second operand. High order bits shifted out are shifted through 
the carry bit. Zeros are shifted in. R, for double shifis must be 
even 


{INPUT WORD 
RR 


RX 


One 16-bit word of data is read into the first operand from the 
device which is addressed by the contents of the second 
operand. eee ee mee mae 


INPUT BYTE 


AX 


n= 


One byte of data is read into the low order 8 bits of the first 
operand trom the device which is addressed by the contents of 
the second operand. i F 


RX 


The 16 bits of R, Is sent to the device which is addressed by the 
contents of the second operand. 


AX 


The low order 8 bits of R, is sent to the device which is addressed 
by the contents of the second operand. 


RX 


Unconditionally branch to the location specified by the second 
operand. The first operand is not used. 


BRANCH ON CONDITION 


RX 


Branch to the location specified by the second operand if the 
>ondition code specified in the first operand postion is equal to the 
current PSW status bits. 


Condition codes are: 


Carry =B (Sign=0) 

No Carry =A Minus =F 
Zero = 5 (Sign=1) 
Not Zero =4 1'sComp> =9 
2'sComp> =0 1's Comp< =8 
2'sComp< = 3 1'sComp> =C 
2'sComp> = 2 1'sComp< =D 
2's Comp< =1 _. Overllow =7 
Plus =E Not Overflow =6 

St 


x 


BAL ; A 


The address of the next sequential instruction is saved in R;, and 
an unconditional branch to the jump address Is taken. 


BRANCH ON INDEX 


BXH HIGH RX 
BXLE LOW OR EQUAL "RX 


R, 1s incremented by the value in A, + 1, and logically compared 
to the index limit held in R, +2. 

INDEX HIGH 

(Ay) + (Ry + 1) > (Ay) 

(Ry) : (Ry + 2) 

IF (Ry) > (Ry + 2) THEN d + (X2) -» PSW (1631) 


IF (Ry) = (Ry + 2) THEN PSW (16:31) + 2-+ PSW (16:31) 
INDEX LOW OR EQUAL 


(Fy) + (Ry + 1) > (Ry) 

(R;) : (Ry + 2) 

(Ry) < (Ry + 2) THEN d + (X)-» PSW (16331) 

IF (R,) > (Ry + 2) THEN PSW (16:31) + 2 > PSW. (16:31) 
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APPENDIX B 
: Seeececercocecesoeesoseeeeeetess : DB 158, Ri°1eBeR2 a  iiaaecoeae 
H " { 
3 MACRO DEFINITIONS FOR HICRO/29 ; ' < 
3 i m EXCLUSIVE OR REGISTERS |; 3? 
3 Seeesceseoseseseseseseseeeeeeees — . 3 Se 7 = 
3 m MACRO B1,R2 
i DEFINITIONS YOR CPU REGISTIRS 3 178, R1°16neR2 
NDA 
Re stte : - ener a a a —= - 
Rl SIT 1 } 
a2 str 2 : neneccesscncccucscncnazanses: 
Vay sit 3 H | ‘ 
uM str $ << - i “—~--RX TYPE INSTRUCTIONS — +---—— 
RG Str 6 3 enenecn ceneccunssecennesnsasescucse 
R7 Sit 7 ; - ’ 
BS sits. ; LD LOAD HENORT sa 
RS SIT 9 13) MACRO R1,22,DI 
us set Hy BB SN Rici@Rerz, (Dr) SUR G,(DI) AND OFYH 
R12 SIT 12 ; 
ais sir 13 ; st STORE IM MEMORT c yo MB ae 
a ; MGMNGHIID : 
pst) $tt 15 St MACRO R1,12,D1 
3s sit H 2 . eda sRA*1086%2, (DI) sua. B.(D1) AND OFFA 
x2 Sit 2 t i |. 
x5 sit 3 ; anD ADD YROM MEMORY ¢ Sa 
157 sit 5 ADD MACRO B1,22, DI xh i eae 
- sir $ DB SAB. Ri9108622,(D1) SER 6,(DI1) AND OFFH 
WD 
18 Sit 8 eee salt r Ee Oe 
xe str B - ee yr i SUB SUBTRACT PROM MEMORY | ; ss 
i211 SET 11 , sus MACRO 81,2, DI toc | 90 
aie 3m a2 ae ‘ J } j { DE SBE, M1°18B+E2,(D1) SHR a,(01) AND OFFER 
ES ERD (eae: Ce eT a —_ 
m4 SIT 14 a ae : rae 
x15 Str 15 } " © up view mrnoay 16 ise sa me 
, 2 . 
3 PRESET CONDITION CODES " MACRO 21,22, DI 
be mes ain ee DB S4H,R1°10N+22, (D1) SER G,(D1) AND OFFER 
NC? SET eau ane CaRaYy ; 
2? SIT O55 mo 3 t) OR WITH HEMOAT 56 
nz? SET 048 iNoT ztR0 wu i ‘ 
cr? 32°S COMP GREATER THAN ° MACRO 21,12,DI , 
Lt SET O35 32°S COMP. LESS THAN DB SGH,Ri°16H0%2, (D1) SER A, wn AND OFYH ) 
CE? SIT 628 32°S COMP. GREATER THAW OR IQUAL 70 ENDH 
Lr? SIT 015 12°S COMP. LESS THAN OR EQUAL TO ; 
PL? SIT O18 sPLUS, SICN = 9 $ CHP > COMPARE WITH MEMORY . 55 
HI? SIT e7E iMINGS, SION © 1 - H 4 oe 
BI? SIT 098 $1°S COMP. BICHER : CNP MACRO R1,X2, D1 
1s? SIT eon 32°S COMP. LOWER OR SAME DB OSK,R1*10NeK2, (D1) SHR G,(DI) AND @FFE 
BS? SET eca 71°S COMP. RICHTER OR SANE 
Lo? SET Dk t1°S COMP. LOWER 
oV7 7a sOVERFLOW ' 
ni? SET G6 SNOT OTERFLOV ; seucecccsveceunscurseuceescucesssecessess 
; 3 . 
: SSS 8 ROS SSS SSS FES SSS SS ESSE SS SSeS 3 IMMEDIATE IWSTRUCTIONS 
; 
3 AA TYPE INSTRUCTIONS ; Secsennvesesenasccesenssusensccsenesccess 
H ; H 
3 Se enecasers cesesesecssaseasceoscss : LI LOAD IMMEDIATE 41 
; H 
; LE LOAD REGISTER 18 LI MACRO R1,12 
is nathot nice BE GTR RICAOR, (22) SUR 6,(I2) AND SYYA 
DB 165, Ri*10HoR2 ; 
- MDM . 1 NI AND IMMEDIATE 94 
3 al ADD REGISTIR u MI MACRO R1,12 . 
a eee are > Ba S40, +Ri®10R,(12) SHR 8,(12) AND OYTE 
DB 145, Ri®1e£+R2 ; 
‘ EXDM i 03 OR IMMEDIATE 96 
3 sR SUBYRACT RECISTER 1 or MACRO 21,12 
i. a : BO geu,Riston, (12) sm 8,(12) AND SYFE 
macho B1,2 
DB LER, W1°102¢R2 3 
ENDK ; Il, «-- RICLUSIVE OR IMMEDIATE 97 
: H 
3 7 | AND REGISTIES 4 II MACRO R1,12 
3 ip S78, »R1°10R, (12) SHR G,(12) AND SPPH 
aR MACRO R1,R2 NDA 
DB 14H, Ri°1@HeR2 5 
EADH ; al ADD IMMEDIATE. oA 
H 
; ORR OR REGISTERS 16 al macRO 21,12 
; DB OAH,Ri®10R,(12) SER @,(12) AND SPY 
ORR MACRO 21,22 ENDA . 
DE 168, R1°10HOR2 i 
zwpe ; s1 SUBTRACT IMMEDIATE on 
i ' 
- cm COMPARE LOGICAL REGISTERS 15 $1 MACRO B1,12 
H DB OBH,R1°1@8,(12) SHR &,(12) AND SFE 
cLR MACRO 21,22 ENDM 
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j / os 
H ct COMPARE IMMEDIATE 95 ; saL SHIT RIGHT LOGICAL 68 : 
3 
cl MACRO R112 SRL muceo BL, Ct 
DB oSa, nisden, (12) SHR 6,(12) AND OT7E ates DB GGE,R1°18Hs(CT=1) 
, BMDA Ex EnDK 
H eeneneananccegeacqonccnnsaacsccsnccessces ' SLL SHIT? LEY? LOcIcaL ao 
3 
; DUANCH AND CONITIONAL BRANCH INSTRUCTIONS SLL macho 21,CT 
3 ‘ ; DE 8S2,B1*10n+(CT-1) 
H wewceescsenensecenassecansssccosssssceses nee xNpn 
3 : . ’ ’ 
i aX UNCONDITIONAL REANCH 7 ; su SHIVT MIGHT ARITENETIC aA ‘ 
i 
ax MACRO 11,01 SA Macro B1,CT 
DB reR-Tieion, (DI) San @,(DI) AMD BYE DB GAf,Ri®ieue(CT-1) 
: ENDH Lapa 
H BC... CONDTIONAL BRANCH « oe i aaL ROTATE RIGBY aa 
’ 
ac MACRO CC,X2,DI RRL machO 21.CT 
DB 47H.CC*1GHex2,(DI) SBR 6,(DI) AND OFTH Da @a6H.B18108¢(C2-1) 
EXON P rNpm 
H . ‘ 
i DAL, BRANCH AND LINK 45 ; BLL ROTATE LEFT As 
Bal maCRO B1,12,DI RLL MAcRO R1,CT 
DB 455, Ri*16+X2,(D1) SER G,(D1) AND OPTE DB CAA ,RI°20N¢(CT-2) 
EaDM ENDM 
; i antes ba : <nhceicnlliahshcnilpainieicipannnnsiaeotal 
H t ; 
i BALR«- BRANCB AND LINE REGISTER os ; ia susttaetions pares 
oe PR GSH ARL AON SRD : escaseae seeeneeuussecesarers 
P ENDM 4 IN 1nP0T ; as 
: ° : 
; R MEANCB RIGISTER UNCONDITONAL = 916 ie apne 413000 
a MACRO 21 ’ 9 t D) @a8,R1°1ONoL2, (DI) SERB, (DI) and OFTE 
DP 24H, Mie1ag 7 , : , ‘ EADK 
; BAU ; cur = oprrur : zm, 
H eccnnnnccneneccccccnccascensccesscsccs as a whi 
i SHIPT AMD ROTATE INSTRUCTIONS DB OAH, 21ieH6r2,(D1) 248 @,(DI) AMD OTT 
3 END 
1 Seeedr ee seserress caer asssaseseseuassss i 
i 
; SLA SMLTT LEYT ARITANETIC |. an: 
SIA MACRO 1,CT ent ; 
DB GBE,Ri*1a5¢(CT-1) ‘ 
EnDA °F 


358 


yryae 
Trae 
@esede 


eeee-1 704 

ooez-aszseeze 
Ceee-esteees 
eoer-asteeeee 
weer-aszsceec 
Oa1z-re000006 


0016 °Ser9CSTA 
0010°41290635 
O81 R-a5 E9SSDS 
OOT2-SBESCSEA 
O626-04t8 


0029 +SST BESET 
@82C-41200483 
0638-45 5065 58 
0034-0827 - 
(0636-45 IOI D6 
063a-38E9035C 
O635-0.58 


Ooaereiseccer | 


0048-41400088 
004C-43590508 
eese-e017 
0052-1691 
oesolerr 
0036+47500082 
vesa-ssz0e0308 
OOSi-o41000TF 
0062-1631 
0064°1677 
0066 +47300082 
006a-Ses3e000 
O06E-Sarccesz 
oe72-9p400002 
9076+47500075 
007 1-7480000C 
OC7E-41300508 
0082-58530008 
0086-Se5063IE 
0061-0458 


ee8cesereesre 
0008°41400005 
9094 ese548008 
0e9e-39506408 


eencee75000cz 
9010-53500608 
0044-47500138 
8040+5550060C 
woacea7seezzz 
0038-+45506626 
eederaizecesT 
bés5eesLecsse 
eenses+5aal78 


genacso oece-ouse 
3 
2 SIPPLE PORITOR FOR TEE AMD BISE-SPIED 16-BIT COMPUTER 
3 oeczrsrenecse 
3 te Jim DRIeE 
‘ Ooce-sst00se4 
1 
MACLID MICROZ® eecacoscorsrs 
a 
data: = QU @TTTAN SUSARY DATA PORT ADDAESS oecE-sicoceie 
STaTUS: BOO GY7TRB OSaky COmTROL POAT ADDRESS 
caLy: = 5Q0 GAOTS «= SLIME-FEED, CARRIACK ATTURW eenz-secoedes 
’ 
onc e A ©036+49590118 
3 
Beciu: 3B none CLEAR RO oepacsizeceze 
wa B14,20,DOCALP = 3UEW LINE Om ConSCLE Oonr-sszeesne 
moar: Bal «=s-16, RO, PROMPT = 33/7 PROMPT Oerz-ssB003D8 
Bab «= s«RNG,RO,CETIP «=. SCET USERS 1/7 oors-<st98126 
Bal —«-s«-BUG,RO,SCANER © SDECODE & EXECUTE ConmusD oeraceizeeeze 
ux Be,rouL? SREPEAT LOOP FOREVER Oozs-<sreesDe 
eerz-asz0019¢ 
Vaoret: St 216,30,0PRONPT 3SA0E BET 
3 corerastecezs 
wu 32,°>° 3Pnompr CRARACTIR “> 
oo7a+saceeses 
pal 2i4,R8,CRTOUT sPRCMPT TO CAT 
oerE-oacoeeie 
a 216,20,0PROMPT SRESTORE BIT 
° “s erez-psceeees 
n ae ' 
7 0106-47400002 
DOCRLY: ST Bie.BO,@ROCRLY SSAVE BET oseteesre 
u 32,CaL? CR/LY CODES Olorcosre 
Bak =sRAG.BO,CRTOUT © 0/P 27 
smi SRT eR CODE erates tal 
‘ 013004867 
216,29,CRTOUT 30/7 CR 
» 24,20 ,000CRLF SBESTORE BET i adbathiecad 
si ii ; 011402867 
ae 0110049500355 
Grrips St -BiG.20.0CETIP | ssavk AIT Ou eere Neate 
Lr RD, BUvTTR sa(1 pores) sragcenee 
u MM, cs0en smal 1/? count 0128-Sereesr6 
U7LPs BAL «= s«BA4,RBLCEYCER «© SCEY BEIT 1/P CBARACTEB eiza+se7eeses 
sub 21.8 sPOSITION 1/P CHAR 20 BI SITE @1zz-ai1Decees 
a 25,23 SSAVE HL 3ITE eisz+sec7e00s 
opm 15,RS BTEST BET CODE 0136°1967 
3¢ . «87,28, DOROF st0 EOF 17 RC © IER0 0138+45 500355 
Bak =, BO,CETCER «© SUEXT CEARACTER o13sc-ese? 
al 21, 00778 USAVE ONLY 1/P CHARACTER 3m LC O13E64szeessE 
- om BS,23 SCOMDIGE TvO BITES FoR Osx vORD O162-s1 200028 
omz = 815,215 STE8T BET CODE e1seoaszeesDe 
x 22,38 ,DOROF $70 BOF 17 BC © ZERO 0144°9n700002 
st 25,33.8 3DATA TO 1/7 BUTTER 0145+52D00001 
aa 35,0002 370 BEIT DUTER SLOT 0152°47000132 
st M4 ,00ez scount-2 0156-ser0esr6 
ae 27,ne,dorOr2 «= sSTCP IF mux I/P oisacoczs 
ux a, UPL? CONTINUE CETTISC 1/P 
posorz: LI 25,0000 3ROy AYTER MAX LIBE O1SCoSereesT6 
porors: 8T 25,23,0 SBATA/BOF TO DUTTER 0160036700006 
» 216,R20,0CETIP SRESTORE BET Sresoesponees 
a as : 0160+5867; 
; ©160+2867 
ScanEas ST B16,29,05CABER 3340E RIT eienvieas 
ar 24, SUPER sa(3/P BovrEe) 0170045200106 
w 5.1.0 s6RT CormueD (7ITED FoRnaT) eivavesuebsee 
ov 25,20,D0PCRD «5 DOOR DUP? e17eenee? 
ae 27,20 DUMP 360 IF TRUE er7acis26 
ne 25,20,STROID «5 $-FOR STOBE? 017645120196 
54 27,29 STORE 360 STORE 17 TRUE 9500306 
cnr 25,28,2mPOND «5 J -POR SUnPT 0384-92700062 
ae 27,20 ,J0nP s6O sump 17 TBUE ©106-520008) 
sal 214,R0,D0CRLY tangy Lisk Om Cat 01806°47400166 
u az.°7° 37 €190-soreesre 
Bat sUmteovs COnuaD 0194-0058 
te SBESTOBE RET 
O1s6-o4z00077 


BaPLPs 6ST 
Bab 
uu 


ireuin st 
mn) 
u 
TIPLEPs LD 
aaL 


t 
bocits at 
cr 


4, orers 
214.no.cvaner 
26,errren - 
212,18 

Re .20,p1P4D 
Bis.n08,t1P4D 
eee 
214.20,¢nT00T 
214,20,caTOUT 
216,28; DnPOUT 
az,’ ° sSPace 
214,28, CR TOUT 


uZ7,RODAPLP 
224,28 ,OSCARER 
aie 


216,28 ,0T7PAD 
bo 


B16 20 ,ODMPOUT 
‘37,28, D67aD 


213.0001 


22,26 

216,20, D0CIT 
214,20, CATOUT 
36.8 

a2.26 . 


M27 RO. TTPLLE 
B14,RO,CTIPLIT 
aud 


32,00778 
az.° ° 


SAC ABDRESS PORTION OF BUTTER) 
AASCIY ADDRESS TO Binasy im 26 
7 bSeSIm Os EVEB wCRD pOUNDAT 


s0/P LIGE Coust 
SSaUE CURREDT O/P acpaEss 


STIPE CURREST CONTENTS OF 26 


370 cat 
s2 spaces 


SPUT OUT OME Link OF DunP DATA 


370 cat 

10/P LITERAL DATA 

InBy Line ow cat 
SCURREAT DUFPOTT ADrarss 
SADDUESS OEIT LIGE 
shiek cous? -1 

sL00P TERU O/P Date 
RESTORE RET 

, 


ssaun att 
tur apberss BITE 
torr 

320 ADDAESS BITE 
so/P 

_FRSTORE BET 

; ° 


3SA0E att 
“Txt O/P DATA 
10/P worD court 
GET EIY voRD 
tEL BITE FIRST 
310/P 

$10 BITE 

torr 


sro car 

sBCnP 1/P DATA ADIAESS 
sVORD COURT 2 

SLOOP THRU LINE 
SRESTORE RET 

t 


sSavE arr 

3GET O/P DATA ADDRESS 
swoRD Count 

3UEIT O/R vORD 

sul DITE FIRST 

330 O/P REG 


TCMECE FOR PRIWTADLE CBARACTER 


30 CRT 
SGET LO BITE 
330 O/P AES 


SCEECE JOR PRINTABLE CHARACTIR 


370 CRT 

STO WEIT vORD 

S¥CRD COURT -3 
SLOCP TERU O/P LIBRE 
SRESTORE SET 

3 


sGgt Low BITE 
SDELOW BLANK? 


Cleaceszeceze 
Clor-erseelaa 
O1arogszeee7r 
M1asea7sreeee 
Ciascqizecezs 
O1as-octe 


O1p0-41400610 
O1decerzeeses 
O136+30400406 
CiacorvDD 

O1ngosszeeiEs 
@iCz-sezeesze 
eiceepssecoer 
Orcacarszeees 
OiCR-aszeei7E 
@lbzoseLeeSTO 
OiDe-ssseceaD 
O1barc7szecee 
O1d1-pacoeeez 
0152-76000135 


O1B6-Sezsesra 
O25 sessecee 
0155 -ees7 
Gire-aszsezie 
atrecsereesra 
erte-oare 


O1racsexaesra 
O17 E+s63ee006 
S2ez-94seenT7 
e20Coaszeez16 
Szsaeseressra 
Reetemre 


Gz18-seressrc 
O21 Gees tvEz3C 
G2ie+sezecsrc 
C21 CeeTS E9008 
G2z0-eSreezSa 
0224¢¢7300232 
Szze+aszse200 
G22Cesersesrc 
8230-0458 

SZ32°41S50000D 
8236*Saree37C 
8234-0480 


023¢-93500029 
6240°47S E0000 
O244-0sS0062E 
0248+ 73 E0000 
@2ecogsseeezc 
G2Sece7sz0000 
Ozs4-ssseceeD 
C230-e450 


e2SacpasooerT 
G25r-95500036 
0262-475 E9000 
0266-psse0634 
626447508268 
C26r+555900061 
6272047930068 
0276-99500847 
0274047308264 
Oz7E-9sserrIT 
0202-0458 


SETPER: 


iy 
STORE: 


STLPs 


s 
UPsTOa: 


‘ 
LosToas: 


t 
‘StDara: 


3 
CEDELs 


, 
ASCEEEs 


VALPES 


USEY Pesion IF TRUE 


LT? 80, SETPER 
m2 ,00778 SaSlow BEL? 
LT7 324.8 sary 17 tevz (cmaR Patavanee) 
a2,°.° SSET PERIOD AS CHARACTER TO Pat 
Pity ' 
Be, BUTOPS sa(aparss FICiD) 
B16 .BO.CUAIOR SASCIS ADDRESS TO BINARY (Im 86 
84,28 DATED SCET CURBEWT I/P Data adress 
Bi3.R33 SCLEAR GIBBLE Covet RES 

IUPPER STE FIRST 

sGn? Bar 

sup? (ca © Eup) 
27,814.80 wart LF TRUE 
R1G.26,LOSTOR SLOWER BITE 

sGET agT 

sup? 

tasT LP TRUE 

370 agzt voRD 


SCONTINUE STORIES Bata 


3SAVE BET 

SCEY BEIT DATA 
tCEr at Brvs 
330 STORE BITE 
SRESTORE RIT 
aie Uy 


85.8 
B14 ,BO,STDaTA 


sSave Orr 
iGBT pata 


B9,00778 JEEEP Low BITE 
Q14.ne,stoara © 330 StOar BITE 
B14.R6,0UPSTCR RESTORE BET" 

a1 ' - 
B14,R8,OSTDATA SAVE BET 

B14,R0,CECEL © SCRECE FOR CELINITIR 
B1G,20,@sTDATA SGT BET 

27,214.0 warr 17 ac © 8 
216,20,ASCBEX SASCII BITE TO REZ HIDILE 
L77,29,SETHD = -3.Z. BC © TED 


IBBLE 0 FSTORE TRIS MIBBLE 


sOSTDATA SRESTORE RET 

’ 

85, 00008 svack OF 

216,20,@STDATA = SRESTORE RET 

aus H] . 

as,°° ' ssPace? 

37,216,8 sant 17 TRUE 
spERiop? . 

27,214,8 . sant iy TRUE 

as.°.° scour 

27,234,8 saBT SY TRUE 

BS ,c0eDE sCamgiask RIT? 

ae SREY CALLER DECIDE 

aS ,oorTa tLOw BITE OaLY 

as,°e° SLOWER Team “O° 7 

Lt?, 214,80 sary IF TRUE 

B9,°3° s0-0 7 

Ut?,Ro.vauM SmunEAical 17 TRUE 

as, ‘a’ shovrn Teak “A° ? 

1t7,210,0 sary ty Tak 

29,0067 SEER APRA? 

LT7,RO,VALPE «= SBEE ACPEA LY TBUE 

as, orrrrn s55T Lt. ce 

are s 

29 ,.coeTs 3ASCIE aDJUST 


o284e93300087 
Ozas-saseeeeT 
828C+1358 
G2at+sare 


e29¢+se7Ee008 
o27d4-1603 
er06+4740074C 
ezsa-sacesses 
0201-603) 
Ozae+st70erry 
e240-1673 
O2aGose7Es008 
Ozab-O6ts 


O2aC-sSpeses) 
ezRO-a7e002Ce 
0204 -aDeceel 
6238-6997 
eziacpevarer? 
O23%°36735 
e2ca+sa7eeaas 
O2ceoears 


exe -sspeenez 


O2CAcereeezre 
e2zck-gabeeeal 
6232-8089 
O234°94 7877607 
6208-3673 
Orpacseveuees 
G2nt-eare 


O2IH+1 733 
@222¢S47RFFI6 
e2t8-1673 
G2zacse7esasa 
e23c-s1ceeee2 
ezre-eare 


ezrzeaiceeaie 
e2re-an teased 
ezyaciare 

O2zrceasrreces 
asas-74800008 


e364 -Sareece2 
as90°1766 
Gseacsasaases 
eser-aes? 
0310-49206252 
0324047400258 
0310°1665 
@31sesasseees 
@31toesnoezsn 
@522+474003598 
es26-0003 
03281663 
eSza-gacoeaer 
eszresasasece 
@332-685? 
334-4 rsezsa 
o338-47400394 
@33c-963 
@25t- 1663 
asce-sas4eaee 
ed4erqsreezsa 
0348947408396 
ESL +63 


' . 

wIsetes ie 
oon 
x 
ar 
str 
Lo 


bratpis cr 
* 
aa 
sun 
7 
om 
sv 
u 


: 
ersiez: Cl 


BS ,c00r2 
25.85 


2726.8 
313,813 

Ott 80, 050193 
B13 .0001 
B12 

a7 .ervT8 
27,83 

37,36.0 

ae 


333 ,0061 

O27 20 UIA Zz 
B33 ,00¢3 

29.8 
87,0ra7TE 
ns 

87,36.8 

rt) 


213 0082 
GI7,Be BIBLES 


419,813 
a7 ertTes 
W715 
a7, 
2e,0002 
ne 


Be, buvcrs 

Big .26,CVADSR 
213,26 

Bie, 223.0 

a6, BEsi6 


B14 22 ,0CA5528 
36.26 

85 ,84.0 

23.6 

816,26, 6SCEEE 


“nxt .aecuacat 


26,R5 

35,246.80 

R146 BO ASCE 
227 20 .CusCoT 


83 


20,0082 

BS ,84.8 

n.8 

224 ,BO,ASCEEE * 
m37,RO,CVROTS 
36,4 


shew wtastz cetr 
tac © 3830 
3 


3SET OLD Bats 

sBi3 ¢ mR0 7 

sTEST PCR COE IF SOT TIUE 

SBORP GIDBLE CCoRT ER 

sPOSITICN THIS BIBLE 

SPREPARE CLI DATs PCR BEY BISEL 
SEQSERT BEV BIDILE 

iBATS BACK TO WErORY 

t 


3USIT BIBBLET 

$30 GEST ly act THIS 

tDCrF BIBBLE COURTER 

SPOSITICS TELS BIIPLE 

SPOEPARE CLE Caza FOR BEw wlbdL 
SUOSERT ASu SISDLE 

“38072 BACE TC MLPORT 

t 


MEST BIBBLE? 

320 BEET IF act Tals 

SESMP SIDBIE Covet 

sPOSITICS THIS BIBILE 

SPREPARE OLD BATA PCR stv SIBDL 
SEuSTRT By BIBILE- 

338TA BACK TO ADCRT 

s 


3EaST GIBBLE (Lsa) 

SPREPARE CLD ats POR ery ataat 
SBuSEaT Sky BIPaLE 

3BaTa RACE TO MOWRY 

sBONP AEN POLATIR 

$ 


saCaaoerss) 

SOSCIE ADCAESS TO DIARY ABtOcS 
sapeasss To BS 

3JUMP... 

sBact TO mcaitce IF CALLEE RITE 


SAVE BET 
sCLEAR 26 

ORT TvO ADDRESS BITES 

suPrER WITS VIBST 

3aSCI3 AYTE TO BEX BIBLE 

ister 17 sor ex care 

SPIRST ADEBESS BIDALE To BE 
3GET ADLANSS BITES Acale 

ASCII BTTE TO BEX MIDBLE 

aster 17 aor BEX BATA 

IPOSITICR ADERESS FOR BEXT 122 
SIMSERT BEIT ADCRISS BIDDLE 
SRUNP AERCRT PTE TO BET utOD 
TUERT ASCE ADERESS Dats 

IRICE DITE FiasT 

ZASCIL BUTE TO BEX BIDILE 
stor IF ect UE para 
SRCSITIOm ADERISS FOR sEST atte 
SURSERT ASIT RIBBLE 

SCNT ADIRESS Para acaze 

SASCID BTTE TC REX Blake 
sSTCP IF eet as Bats 

SPOSITICN AVERESS FSR GRIT W199 
S2USERT DEAT RIDDLE 


359 


360 


@yteices 
j338-oaceneez 
O334°30000406 
@558-SeLveCet 
essere 


O3aE-9BreOSTE 
G362°1026 

364-6873 

eSee-se2zeeeny 
O2Ea-4 SK EIEO 
@3er-ast0a3ce 
@3°2°1626 

Or teckesonre? 
O3erasreesee 
@SICHASENOIEO 
O360-582¢0376 
Odte+eerte a 


Slot -ssseeres 
036247788302 
@30T-Pa2zeeee? 
Gl92-£ 4200038 
@30t-+eers 


0506 +30T E0408 
@50C-aeerrys 
O24 0-642 900082 
S3R0eTS2099C 
PUTT Hah 
e2acevsiveer? 
@3acoresy 

GlB2-as keeles 
e3at+leiz 

OSte-38raN ese 
alecearerniy 


crscets 
Cenotar 


‘ 
BieoUTs 


i 
SBTCER: 
BDCER: 


a 
st 
a 
ty 


4,eee2 

Ra, RO,batad 
A246 ,RO,CCTABER 
au 


Did Be, Osisour 
B2,06 

a2,4 

32 ,020Tn 
Bia Re UETEZ 
B14 ,8e,CatoUT 
42,36 


BLD, STATUS 
B1,ose2 
27,28, RDCmR 
22,30,Date 
23,00778 
az.Ra 


Lty 
B1.az 

Bid, 28, 251TCER 
Bi5.-3 

Az, eeean 


oC2TCUT 


10 DEIT mERcRT vous 
SAVE aS Data Adzarss 
iaEstoar ast 

: 


TSAvE Br 
20? BITE TO Bz 

TUPPER ALDDLA TIAST 

IEEEP CALY GOOD oaTA 

SBUWARY NIDILE TO ASCEL BITE 
RUBBLE (BITE) OUT TC CAT 
30s? Bata TO a2 

SHEEP ORLY LOW BIDDLE 
3DUMART BIBBLE TO ASCLY DTTE 
tand out TO cat, 

saRSTCRE 25 * 
Hy 


tar? 

332 IF wot TRUE 
sADJUST FCA A-7 
SMAEE ASCII 

‘ 


3SAE BET 
SSTRIP PARITY 

31/? BEADT? 

LOOP UnTIL CHARACTER B4ADT 
3aBaD DATA 4 
SESEP OLY Dara OTTE ' 
DATA TO AZ 

3ECEO 1/P 

BATA DACE TO RI 

sCRT BET 

SET AIS oz. 

SLY COPE Im CASE OF CR 


O3Ceraizeeras “ 
e 
Oscerssieveat © sé 
ercb-e7ezecee Coes 
sy Bab 
esccoasreoste 
i ma. 
esceci777 
to 
@3D2+s0520<08 + 
eSve-eare m4 
i 
catours Im) 
oSre-aeierrys 
ar 
OdDC-Sarecees 
5 T 
Odte+a7S00508 
our 
@stecazzeryra 
a. 
e3re-esre 


1 

@PaCrr? CS 
WDOCRLY DS 
OCKIIP 
OSCanmA BS 
@rrrad DS 
OLRPocUT ES 
OTTALIT DS 
OSTORE BS 
OLPSTO BS 
OSTDATA DS 
ORIACUT OS 
OCKTCAA LS 
Ocvadra cs 


1 
CmpaDs DS 
carad: BS 


Enpcnd: 3 
STACHD: DB 
anvcno: Dp 


4620 
2328 
4120 


week TEA: DE 
ale BUrORss os 
acne 


@254 ASCEEr 
@23C CEDEL 


OSES vLocaLy 
"onc OSTDATA 


0400 OCEICAR 
O378 OSTORE 


23,20, STATUS 
Rioeen 
27,20 ,carour 
B2,28,DATA 
ane 


NN NNN NNN NNN 


O3SE BInOUT 
@,0D CBALY 
Iva Data 
@iS2 TMPLPP 
@o7K SoLor2 
Oo4c IPL? 
2250 mI2OLE 
Bs9c ADCHA 
@218 STDATA 
@13C TIPLIT 
GSTE vaInoUT 
OSEE OCLTIP 
OSy2 OTTPAD 


sparta © ca? 
3013 IF a0 

300 bP IF PAEVOIUS VAS CR 
SET AC © 2ER0 FOR Ch’ 
SMESTORL ALTURY 

’ 


SGRT STATUS BITE 
SEMITTER EMPTIT 
swalt FOR IMLTTES YO Erety 


sO/P DATA 20 CRT . 

3 

Osek SUTTER Sele seyors 
O30G CRIOLT O3e4 CrAccE 
sce DaTAD Gace Peyad 
@i26 CmPpouT giss Weir 


O350 SITCED 
e272 Ju") 
O2CE wIDIsz 


@Gc2 OURP 
@40C JMPCRD 
G2aC MIMIBL 


Ce@Oc SCAnER @232 SEtad 
@12E STLP olde szcie 
O1¢e TYPLLP OLEH UPsitce 


@374 DoRPCVT 
OSye Occaate 
@37a UPTON 


€402 VCTALER 
@SEA GPRONFT 
8376 OTTPLIT 


APPENDIX D : ! are eh A disk and most user generated programs (source programs, 
user libraries, special assemblers, etc.) are on the B disk. In the 
following session, lower case letters are what the user typed-in, 
upper case letters are what System 29 responded, and com- 
ments (added as a tutorial) are in curly brackels. 


Time System 29 operating system manages two floppy disk drives, 
AkandB. The system will prompt with a A> or B> depending upon 
which disk the operator selects as the default. Generally, most 
Swystem programs (editors, debuggers, compilers, etc.) are onthe 


j 


A>ed b: amd16bit.asm _ {call the editor to edit AMD16BIT.ASM from the B disk} 

, u 5 {any program addhtions, changes, and/or deletions go here} 

*e ae {exit the editor and save the new AMD16BIT-ASM on the B disk} 
A>b: sais Oak ” {switch to the B disk as default} 


B>mac29 amd16bit Sab hb pb sb {use the modified macro assembler (MAC29) to assemble AMD16BIT.ASM and put the 
HEX, PRINT and SYMBOL files back on to the B disk} 


ASM29 VER..1.0 


0490 

03BH USE FACTOR 

END OF ASSEMBLY z cee 

B>a: o— - {switch back to the A disk} 

A>ddt29 he ‘ {run DDT29, Halt the 16-bit computer's clock and Exit DDT29} 

A>set pa 3d os {set the page register bit to enable the 16-bit computer main memory as 9080 upper 32k} 
A>ddt {toad 9080 DDT} 

ODT VERS 1.4 

#1b:amd16bit.hex {reference the simple monitor's HEX file on the B disk} 

#18000 i {read AMD16BIT.HEX into 9080 memory beginning at location 8000 HEX (upper 32k)} 
NEXT PC END 

840E 0100 577F 

#tc {exit DDT via contro-C} 

A>Ibpm m29 wes cl ul de 1 {toad the 16-bit computer's microcode (phase-1 instruction set)} 


LOADING: M29.OBJ 

TITLE: MICROPROGRAM FOR 16-BIT COMPUTER 
VERIFYING: M29.OBJ 

TITLE: MICROPROGRAM FOR 16-8IT COMPUTER 
VERIFY COMPLETE 


A>ddt29 ir 0 jr {run DDT29, set the instruction address register to zero (IR 0). jam the address on to the 
microprogram address bus (J), and run the 16-bit computer's clock (R)} 


set to 9600 baud and connected to console USART can now 


At this point, the AMD 16-bit high speed computer is running : 
exercise the simple monitor. 


phase 1 instruction set in microcode and the simple monitor in 
target machine language in 16-bit main memory. A CRT terminal 
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Memory Board 


The 16-Bit Computer Main Memory board was organized with 8k 
by 16-bit RAM section and a 2k by 16-bit ROM section. The RAM 
section occupies address 0-8k while the ROMs are assigned 
addresses 8k through 10k. The memory word consists of two 
bytes. The least significant address line specified whether high or 
low byte but is not used in the word mode. The address value from 
the computer is captured in a register at the beginning of the 
cycle; however, the most significant address lines are routed 
Straight from the bus to the clock decode logic to make an early 
decision as to whether the memory board has been selected. 


In the word mode, the read and wnite transters are straight for- 
ward. For the byte read mode, data is output on bus bits BDp.7 
while BDg,.15 are forced to zero. During byte write modé bus bits 
BDpo.7 are duplicated intemally on lines Do.7 and lines Dg.45. The 


signals WRHIGH or WRLOW select which byte in the RAM 


memory is etfected. 


The control logic generates the bus control line sequencing re- 
quired by the 16-Bit Comouter. The memory read and wnite timing 
is shown in Figures E1 and E2. The bus controller function is 
simulated for the purposes of the prototype. Bus Request is 
Glocked into a flip-flop and Bus Acknowledge is retumed to the 


computer. The Memory Request signal from the computer ini- 
tiates a memory cycle. Fifty nanoseconds later the memory board 
responds with Address Accept. The computer then follows this 
with Data Request. The memory board responds with Data Syne 
and 50 nanoseconds later the data read out of the memory is 
clocked into the output registers and output on the data bus. 
Looking at the memory read timing diagram, it is seen that aread 
cycle is initiated with Memory Request but the data is not sent 
back to the computer until the beginning of the next microcycle. 


The write cycle is extended one oscillator cycle. This is necessary 
with the Am9124 RAMs because the data are not sent to the 
memory board until Data Request goes active (see Figure E2), 
which is 100 nanoseconds into the write cycle. With the clocked 
handshaked memory protocol of the 16-Bit Computer, this is 
easily done by delaying Data Sync one oscillator cycle. Since 
normally a computer performs many more read than wnies, this 
impacts throughput only slightly. 


Additional logic was appended to allow the memory to be ac- 
cessed by the System 29 microprogramming development sys- 
tem. The Map Page (MAPP) of System 29 was usedto specity the 
themory. The logic interfaces the control signals required by 
System 29 and the 16-Bit Computer Memory board. With this 
logic, the System 29 user can readily read or write into the 
memory. ‘ ws 
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Figure E1. Memory Read Timing. 
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Figure E2. Memory Write Timing. 
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AC performance, worst case, 106 
AC propagation delay paths, longest, 33 
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microcode, 166-167 
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ADD IMMEDIATE microword, 123, 124 
ADD instruction: 
description of, 349 
steps for, 98 
ADD microword, 123, 124 
ADD RAI microword, 123, 125 
ADD WITH CARRY instruction description, 349 
Adder: 
carry lookahead, four-bit, 94-95 
full (see Full adder) 
Address, effective, 7 ; 
Address-based microinstruction cycle, 35 
Address compare mode: 
Am2940, 242 
Am2942, 251 
Address counter: 
Am2940, 241 
Am2942, 250 
Address-data-based microinstruction cycle, 35, 37 
Address-instruction-based microinstruction cycle, 
35,36 
Address-instruction-data-based microinstruction 
cycle, 35-37 
Address line control, 240 
Address maintenance, 240 
Address multiplexer, 15 
Address output buffers, Am2940, 242 
Address register: 
Am2940, 242 
Am2942, 250 
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Addressing, 7 
direct, 7,265 
immediate, 7,265 
indexed, 264, 281 
relative, 4-5 
stack, 265 
Addressing modes, HEX-29, 264-265 
ALU (see Arithmetic & Logic Unit) 
Am25LS168 decade counter, 47 
Am25LS2521, 306 
Am25LS377 eight-bit registers, 39 
Am26S02, 306 
Am27S29 512 x 8 fuse-link PROM, 288-290 
Am2900 family, 7, 61 
Am2900 Learning and Evaluation Kit, 260 
Am2900 system, Am2914 in, 215-220, 229-231 
Am2901A arithmetic logic unit/function generator, 
93, 98-101 
Am2903 compared with, 106, 113 
architecture, 99-101 
arithmetic operation speed computations, 
108 , 
logic operation with shift speed computations, 
109 
logic operation speed computations, 108 © 
magnitude only arithmetic operation with shift 
down speed computations, 110 
microinstruction control, 101 
as program control unit, 203, 204 
propagation delays, combinational, 126 
set-up and hold times, 126 
switching characteristics of, 125, 126 
timing analysis summary, 144 
two's complement arithmetic operation with. 
shift down speed computations, 109 
Am2901B ALU/register sets, 290-298 
input bus, 298, 300, 301 
output bus, 298, 299 


387 


388 


Am2902 carry lookahead generator, 214-215 
logic diagram, 216 
Am2902A carry lookahead generator, 93 -96, 201 
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ALU destination control, 180 
Am2901A compared with, 106, 113 
architecture, 101-104 
ALU, 102 
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ALU functions, 103, 180 
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instruction decoder, 104 
output buffers, 104 
Q Register, 103-104 
special functions, 103, 180 
two-port RAM, 102 
arithmetic operation with 16-bit speed computa- 
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block diagram, 102 
data bus cascading, 106 
example, 220, 221 
general description, 101 
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- 111 
logic operation speed computations, 110 
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down speed computations, 112 
microprogram control bits, 115, 122 
microroutines with, 146-159 
microword ADD, 123, 124 
microword ADD DIRECT, 123-125 
microword ADD IMMEDIATE, 123, 124 
icroword ADD RAI, 123, 125 
Ricroword FETCH, 122-124 
icroword INIT, 122, 123 
mnemonics, 180 
programming, 104 
propagation delays. combinational, 127 
RAM address cascading, 106 
registers, expanding the number of, 105-106 
sample microroutines, 122-125 
set-up and hid times. 127 
16-bit de~.gn, 113-125 
switching characteristics of, 125, 127 
tim:ag analysis summary. 144 
two's complement arithmetic operation with shift 
down and 16-bit speed computations, 112 
Am29064 status and shift control unit, 131 
AMDASM Phase 1 and Phase 2 listings of micro- 
programs, 168-179 
arithmetic operation ar.d 16-bit speed computa- 
tions, 141 
arithmetic operation speed computations. 138 
arithmetic operation two's comptement with shift 
down and 16-bit speed computations, 142 
assumed set-up time, 135 
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BCD add, 157, 159, 166-167 

BCD hardware additions, 154-155 

BCD to binary conversion, 155-156, 164-165 

Binary to BCD conversion, 156-158, 164-165 

bit operations, 132 

block diagram, 131 

carry-in contro! multiplexer instruction codes, 134 

carry-in multiplexer, 132 

condition code multiplexer, 132 

condition code output instruction codes, 134 

CPU hardware diagram, 182-187 

criteria for comparing two numbers following “A 

_ minus B" operations, 134 

double-length normalize command, 146-149, 
162-163 

load operations, 132 

logic operation with shift seoed computations, 
139, 141 

- logic operation speed computations, 138, 140 

machine register condition code output, 181 A 

machine status register, 131) “Uo oui. : 

machine status register instruction codes, 132, 
181 Inlunia 

magnitude only arithmetic operation with shift 
down speed computations, 140, 142 

microprogram structure, 144-145 

microregister condition code output, sk 

microstatus register, 131 f 

microstatus register instruction codes, 182. 181 

mnemonics, 181 

non-restoring binary roots, 153-154, 162- 163 

normalization microroutine, 146-147 

pipelined microprogram bits, 144-146 

preliminary switching characteristics, 135 .. 

register operations, 132 

sample microroutines, 146-159 

shift linkage multiplexer instruction codes, 133 

shift linkage multiplexers, 131~132 

16-bit. 135, 144 

standard device Schottky speeds, 135 - 

status registers, 131 

timing analysis, 135 ag 

two's complement arithmetic operation with shift 
down speed computations, 139 

two's complement division, 150-153, 160-161 

two's compiement multiplication, 150, 160-161 

unsigned multiply, 147, 149, 160-161 

Y output Instruction codes, 135 
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computer control unit using, 42-45 


Am2910 microprogram sequencer, 9. 17, 229, 231, 
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architecture. 17-18 
block diagram, 2231 
calculatious on, 24-27 
complete CCU using. 43, 48-49 


Am2910 microprogram sequencer (Cont.): 
computer control unit architecture using, 22-38 
initializing, 38 
instruction set, 19-22 

table, 18 
Am2911 microprogram sequencer, 15-17, 220, 222 
in computer control unit, 39-42 
propagation delay calculations for, 24, 28-31 
Am2914 Vectored Priority Interrupt centroller, 
211-235 
in Am2900 system, 215-220, 229-231 
block diagram, 212-214 
cascading, 214-215 
logic symbol, 212 
microinstruction set, 212-213 
programming, 212-213 
registers, 213-214 
time delay using, 220, 224-229 ~ 
Am2918 four-bit register, 10, 42,47 
Am2922 eight-input multiplexer. 24-32, 47 
Am2930 program centrol unit, 201-204 
block diagram, 201 
parallel look-ahead expansion scheme for, 202 
ripple expansion scheme for, 202 
Am2940 DMA address generator, 241-249 
architecture, 241-247 
control modes, 242-243 
example design, 247-249 
general description, 241 
instructions, 243 
timing. 243-247 
Am2942 Programmable Timer/Counter, DMA Ad- 
dress Generator, 249-256 
architecture, 249-251 
block diagram, 250 
control modes, 25% 
example designs, 252, 254 -256 
function table, 253 
general description, 249 
instructions, 251-253 
Am29705 16-word by 4-bit two-port RAM, 105-106 
‘ block diagram, 105 
Am29761 265-word 4-bit PROM. 39 
Am29803A 16-way branch control unit, 43 
computer control unit using, 42-45 
function table, 46 
Am29811A next address control unit, 15-17 
in computer control unit, 39-42 
instruction set, 16 
instruction set difference from Am2910 instruc- 
tion set, 22 ; 
propagation delay calculations for, 24, 28-31 
Am74S138, 306 
Am74S158 Two-input Multiplexer, 42 
Am74S175,42 _ 
Am74S181 four-bit arithmetic logic unit/function 
generator, 96, 97 
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Am74S240, 306 
Am74S373, 306 ; 
Am74S374 registers, 290, 291,301,305 
Am9080A type microprocessor, 10, 239 
CRT controller and, 62 
AMD 16-Bit Computer design (see Super Sixteen) 
AMD System/23 (see System/29) 
AMDASM, 10 
definition file, 330, 336-337 
and assembly file, 79-87 
Phase 1 and Phase 2 listings of Am2904 micropro- 
grams, 168-179 
AND BYTE instruction description, 351 
AND instruction description, 349 
Applications, microprogram state machine, 
33-34 
Architecture: 
Am2901A, 99-101 
Am2903 (see Am2903, architecture) 
Am2940, 241-247 
Am2942, 249-251 
computer, 2-10 
microcomputer, block diagram of, 10 
microprogrammegd, 10 
possible state machines, 34-37 
figure, 35-37 
subroutine stack, 200 
Super Sixteen CPU, 321-330 
logic diagrams. 365-384 
Arithmetic & Logic Unit (ALU), 3-6, 93 
Am2903, 102 
destination control, Am2903, 180 
destination control summary, Am2903, 104 
functions, Am2903, 180 
multi-register, 98 
shifter, 102-103 
Super Sixteen, 321-323, 366-367 
Arithmetic & Logic Unit/accumulator configuration. 
96-99 
Arithmetic & Logic Unit/register sets, Am2901B, 
290-298" 
Arithmetic & Logic Unit/three register machine, op- 
eration of, 98-99 
Arithmetic operation with 16-bit speed computa- 
tions: 
Am2903, 111 
Am2904, 141 
Arithmetic operation speed computations: 
Am2901A, 108 
Am2904, 138 
Arithmetic operation two's complement with shift 
down and 16-bit speed computations, Am2904, 
142 
Arithmetic operations. 3 
HEX-29, 266-267 
Arithmetic Processor Unit. 6 
Asynchronous events, 207 
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BCD add, 157, 159 
microcode, 166-167 
BCD conversion, binary to, 156-158 
microcode, 164-165 
BCD hardware additions, Am2904, 154-155 
BCDB to binary conversion, 155-156 
microcode, 164-165 , 
Binary conversion, BCD to, 155-156 
microcode, 164-165 . 
Binary roots, non-restoring, 153-154 
microcode, 162-163 
Binary to BCD conversion, Am2904, 156-158 
microcode, 164-165 
Bipolar microprogram controller, 17 
Bit operations, Am2904, 132 
” Bit stice timing, 106-113 
Block diagram: 
Am2903, 102 
Am2904, 131 
Am2910, 231 
Am2914, 212-214 
Am2930, 201 
Am2942, 250 
Am29705, 105 
DMA peripheral controller, 248 
HEX-29, 278, 280, 282-285 
microcomputer architecture, 10 
Branch and link flow chart, 339, 341 
BRANCH AND LINK instruction description, 355 
Branch and stack instruction, 200 
Branch code, HEX-29 microprogram sequencer, 288 
BRANCH instruction, 14 
BRANCH instruction description, 355 
BRANCH ON CONDITION instruction description, 
355 
BRANCH ON INDEX instruction description, 355 
BRANCH operation, 4 
Branches, conditional, 261-262 
Bulk Memory. 6 
BYTE SWAP instruction description, 351 
Bytes, & bit, 191 


Call executive instruction, 275 

CALL instruction description, 352 

Call interrupt service routine microprogram, 220, 
223 

Carry flag, 263 

Carry generate. 94 

Carry-in, 94 

Carry-in control multiplexer instruction codes, 134 

Carry-in multiplexer, Am2904, 132 

Carry lookahead adder, four-bit. 94-95 

Carry lookahead generator. Am2902A, 93-96, 201 

Carry method usages, ripple, 95 

Carry output flag, 99 


Carry propagate, 94 : 
Cascading the Am2914, 214-215 
CCU (see Computer Control Unit) 
Central processing unit architecture, Super Sixteen, 
321-330 
logic diagrams, 365-384 
Central Processor Unit (CPU), 3-7 
hardware diagram, Am2904, 182-187 
HEX-29 (see HEX-29 CPU) 
with internal high speed registers, 3, 4 
read timing, Super Sixteen, 325 
Centralized DAM, 240 : 
Channels, 275-276 
Clock, system: 
Am2940, 242 
-Am2942, 251 
8-phase, 8 
HEX-29, 278, 281, 286 
Clock controj, Super Sixteen, 322, 324, 374-375 
Clock pulse (CP), 262 
numbered, 66 
COMPARE instruction description, 350 


* COMPARE LOGICAL BYTE instruction description, 


351 
COMPARE LOGICAL instruction description, 350 
COMPARE LONG instruction description, 353 
Computer, stored-program, 3-7 
Computer architecture, 2-10 
(See also Architecture) 
Computer basics, 3-5 
Computer control flow diagram, 34 
Computer control function flow diagram, 9 
Computer Control Unit (CCU), 5-10 
architecture, 15-17 
using Am2909 and Am29803A, 42-45 
using Am2910, 22-38, 43, 48-49 
using Am2911 and Am29811A, 39 
using Super Sixteen, 322 
set-up for high-speed micro-level interrupt han- 
dling, 232 
timing, 23-33 
Computer data path, three register, 97 


* Computer design, AMD 16-bit (see Super Sixteen) 


Condition code input, 202 
Condition code multiplexer, 15. 
Am2904, 132 
Condition code output instruction codes, Am2904, 
134 
Condition code register, HEX-29, 263-264 
Condition select multiplexer, 261 
Conditional branch flow chart, 339, 340 
Conditional branches, 261-262 
CONDITIONAL JUMP PIPELINE instruction, 19, 
21-22 
figure, 20 
CONDITIONAL JUMP REGISTER/COUNTER or 
PIPELINE instruction, 21 


CONDITIONAL JUMP REGISTER/COUNTER or 
PIPELINE instruction (Cont.): 
figure, 20 


Conditional jump speed computations, 24, 25, 28, 29 


CONDITIONAL JUMP-TO-SUBROUTINE instruc- 
tion, 19, 21 
figure, 20 

CONDITIONAL JUMP VECTOR instruction, 21 
figure, 20 

Conditional jumping, 13~15 

Conditional operation, 4 

Conditional push speed computations, 32 

CONTINUE instruction, 22 
figure, 20 . 

CONTINUE (CONT) statement, 13 

Control! modes, Am2940, 242-243 

Control register: 

Am2940, 241 
Am2942, 250 

Control store, 260 

Counter register, microprogram, 15 

CP (clock pulse), 66, 262 

CPU (see Central Processor Unit) 

CRT controller: Eig? 
AMDASM definition and assembly files, 79-87 
complete wiring diagram for, 50, 52-53 ° 
design of, 47, 50-61 
display formats accommodated on, 78-87 
logic diagram, 47, 51 

of interface circuit for, 62, 63 
microprogram for, 50, 54-56 
principle of operation, 47, 50 
software emulation, 66-67 
of System 29 universal card, 88 
timing considerations, 50, 57-61 
wiring diagrams, 62, 64-65 

Cycle steal method, 240 


-Dbus,97 .- 
Dinput, 101 
Data-Based microinstruction cycle, 35, 36 
Data bus cascading, Am2903, 106 
Data formats, 191 
Data movement, HEX-29 mnemonics, 267 
Data movement.capabilities, 7 
Data multiplexer: 
“Am2940, 242 

Am2942, 251 
Data path, 92-187 é 
’ Super Sixteen, 324, 370-371 

three register computer, 97 
Data routing, 261-262 
Data transfer contro!, 240 
Dead page, 274 
DECIMAL ADD instruction description, 353 
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DECIMAL SUBTRACT instruction description, 353 
DECREMENT INDEXES instruction description, 
353 
Defined register class of instructions, 265 
Definition file, AMDASM, 330, 336-337 
Delay, time, using Am2914, 220, 224-229 
Delay path, longest signal, 23-24- 
Delays, 6 
Depth, memory, 5 
Depth-over-width (d/w) ratio of memory, 10 
Design, microprogrammed, 12-61 
Direct addressing, 7, 265° 
Direct memory access (DMA), 6, 238-256 
centralized, 240 
control, HFX-29, 301,305 
distributed, 240 
implementation, 240 
1/0 system, 240 
peripheral controller block diagram, 248 
repetitive, 240 
Direct memory access (DMA) Address Generator, 
Am2940, 241-249 
Direct memory access (DMA) Controller, 240 
Disk drive management, System 29 operating sys- 
tem, 361 
Distributed DMA, 240 
Dividend in divide operations, 151 
Division, two’s complement, 150-153 
microcode, 160-161 
Divisor in divide operations, 151 
DMA (see Direct memory access) 
Doubte-length normalize command, 146-149 
microcode, 162-163 
Double words, 32-bit, 191 
d/w (depth-over-width) ratio of memory, 10 


Effective address, 7 

Emulation, software, of CRT controller, 67-77 

Enable control, extended, 42 

Enable stack signal (FILE ENABLE), 15 

Engineering model Super Sixteen, 346-348 

EXCHANGE BYTE instruction description, 351 

EXCHANGE PROGRAM STATUS instruction de- 
scription, 351 

EXECUTE instruction description, 353 

Execution of microinstructions, 13 

Executive interrupts, 207 

Exponent, signed, 191 

Extended enable control, 42 


F bus, 97 
FETCH, overlapping or pipelining. 14-15 
FETCH instruction, 4 
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FETCH microword, 122-124 
Fetch routine, 220, 223, 231 
Fields, microinstruction, 13 
FILE ENABLE (enable stack signal), 15 
Fixed point numbers, 191 
Floating point numbers, 191 
Flow diagram: 
computer control, 34 
computer control function, 9 
read control, 249 
FORMATTING, 13 
Fraction, signed, 191 : 
Full adder: 
basic, understanding, 93-99 
four-bit ripple-carry, 9 
truth table, 93 ° 
Full adder cells, cascaded, 93 
Full stack, 19 
Function logic, 262 


General purpose (GP) computer, 8 
General register class of instructions, 265 
Generate, carry, 94 

GP (general purpose) computer, 8 


HAL (HEX-29 Assembly Language), 264 
Half sign flag, 263 
HEX-29 Assembly Language (HAL), 264 
HEX-29 CPU, 258-315 
addressing modes, 264-265 
arithmetic operations, 266-267 
block diagram, 278, 280, 282-285 
carry-in control, 290 
condition code control, 290, 298 
condition code register, 263-264 
DMA control, 301, 305 
DMA/refresh control, 275-276 
features, 259-260 
general specifications, 263-275 
instruction matrix, 273-274 
instruction set, 265-274 
internal CPU registers, 263 
interrupt control, 301-304 
interrupt structure, 275 
macro instructions, 268-272 
microcode, 307-315 
microprogram control, 281, 287-288 
microprogram sequence branch code, 288 
microword memory, 288-290 
operating system for timesharing (HOST), 275 
shift and rotate linkage, 290 
system bus, 276-277 
system clock, 278, 281, 286 
system design goals, 259 


HEX-29 CPU (Cont.): 
system timing, 277-281 
HEX-64KBS static memory card, 306 


ICU (interrupt control units), 215, 219, 220 
Immediate addressing, 7, 265 
Immediate instruction flow chart, 337, 339 
Indexed addressing, 264, 281 
Indirect addressing, 7 
INIT microword, 122, 123 
Initializing the Am2910, 38 
Input, vector, 42-43 
Input bus, Am29018, 298, 300, 301. 
INPUT BYTE instruction description, 354 
Input instruction flow chart, 340, 341 
Input/output (see I/O) 
INPUT WORD instruction description, 354 
INSERT CHARACTER instruction description, 350 
Instruction-Based microinstruction cycle, 35 
Instruction control speed computations, 27, 30 
Instruction-data-based microinstruction cycle, 35, 
36 
Instruction Decoder, 9 
Am2903, 104 
Am2940, 242 
Am2942, 251 
Instruction descriptions, Super Sixteen, 349-355 
Instruction Enable pin, 202 
Instruction formats, 191 
HEX-29, 264 
Super Sixteen, 321 
Instruction matrix, HEX-29, 273-274 
Instruction register, 7 
Instruction set, 7 
Am29811A, 16 
HEX-29, 265-274 
Instruction types, 191-197 
memory immediate instruction, 197 
memory to memory indexed instruction, 196 
memory to memory instruction, 194 
register immediate instruction, 196 
register to indexed memory instruction, 195 
register to memory immediate instruction, 195—- 
196 
register-to-memory-reference instruction, 193-— 
194 
register-to-register (RR) instructions, 191-193 . 
register with short-immediate instruction, 194~- 
195 
Instructions, 3 
Am2940, 243 
Am2942, 251-253 
defined register class of, 265 
executed sequentially, 198 
executing, 4-5 


Instructions (Cont.): 
general register class of, 265 
Super Sixteen, 319-321 
(See also Microinstructions) 
Interface circuit for CRT controller, 62, 63 
Intermediate slice (IS), 102 
Internal high speed registers, CPU with, 3, 4 
Interprocessor interrupts, 207 
Interrupt, 206-235 
Super Sixteen, 324, 376- 377 
Interrupt acknowledge, 208 
Interrupt control, HEX-29, 301-304 
Interrupt control units (ICU), 215, 219, 220 
Interrupt Controller, 6 
Interrupt driven I/O, 239 
Interrupt example, microprogram level, 229-235 
Interrupt handling: 
computer control unit set-up for high-speed 
micro-level, 232 
sequence of events for, 207-208 
Interrupt masking, 207-208 
Interrupt microprogram, return, 220, 223, 231 
Interrupt nesting, 210 
Interrupt priority encoder, 210-211 
Interrupt recognition, 207 
Interrupt registers, HEX-29, 264 
Interrupt request: 
instruction flow during, 220, 223 
multiple, 209 
daisy chain acknowledge, 209 
single: daisy chain acknowledge, 208 
multipte poll, 208 
Interrupt request clearing, 210 
Interrupt request handling, multiple, 210 
Interrupt request masking, dynamic, 210 
Interrupt request prioritization, 210 
Interrupt request priority threshold, 211 
Interrupt Return instruction, 208 
Interrupt sequence timing, 234, 235 
Interrupt service routine, 208 
interrupt service routine inicropicarem: call, 220, 
223 
Interrupt service routine nesting, 211 
interrupt structure, 208-209 
general purpose, 210-211 
HEX-29, 275 
Interrupts: 
machine versus microprogram level, 207 
microprogram, 43 
priority schemes in, 209-210 
types of, 207 
Intraprocessor interrupts, 207 
Intrasystem interrupts, 207 
Invalid access block, 274 
Invalid instruction trap, 275 
/0 (inpuvoutput), 239 
devices, 3,6 
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VO (input/output) (Cont.): 
DMA, 240 
Super Sixteen, 324, 376-377 
write timing, 328 
IS (intermediate slice), 102 


JMP (JUMP) instruction, 13 
JUMP, UNCONDITIONAL, 14-15 
JUMP and ZERO (JZ) instruction, 19 
figure, 20 
JUMP (JMP) instruction, 13 
JUMP MAP instruction, 19 
figure, 20 
Jump map speed computations, 26, 29,30 - 
JUMP operation, 4 
JUMP-TO-ONE-OF-TWO-BRANCH-ADORESSES in- 
struction, 17 
JUMP-TO-ONE-OF-TWO-SUBROUTINES instruc- 
tion, 15, 17 
JUMP-TO-SUBROUTINE instruction, 15, 200 
Jumping: 
conditional, 13-15 
microprogram, 13 
JZ (JUMP and ZERO) instruction, 19, 20 


Last-in first-out (LIFO) stacking arrangement, 199 
Latch bypass, 220 
Latency times, 6 
Least significant slice (LSS), 102 
LIFO (last-in first-out) stacking arrangement, 199 
LOAD BYTE instruction description, 350 
LOAD COUNTER AND CONTINUE instruction, 22 
figure, 20 
LOAD instruction description, 349 
Load operations, Am2904, 132 
LOAD PROGRAM STATUS WORD instruction de- 
scription, 351 
Load select control function, 14 
LOAD STACK instructions, description of, 352 
Logic diagram: 
Am2902, 216 
Super 6ixteen CPU, 365-384 
Logic operation with shift speed computations: 
Am2901A, 109 
Am2903, 111 
Am2904, 139, 141 
Logic operation speed computations: 
Am2901A, 108 
Am2903, 110 
Am2904, 138, 140 
Logic symbol, Am2914, 212 
Logical address, 273-274 
Logical data, 191 
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Logical operations, 3 

. HEX-29 mnemonics, 267 

Lookahead adder, four-bit carry, 94-95 
LSS (least significant slice), 102: 


Machine interrupts versus microprogram level inter- 
tupts, 207 
Machine level instructions, microprogram instruc- 
tions versus, 9-10 
Machine register condition code output, Am2904, 
181 
Machine status register, Am2904, 131 
Machine status register instruction codes, Am2904, 
132, 181 
Machines, microprogrammed, 260-262 
versus non-microprogrammed, 8-9 
Macro assembler disable opcode patch, 347 
Macro instructions, HEX-29, 268-272 
(See also Instructions) 
Macro library, 347, 356-357 
Magnitude only arithmetic operation with shift down 
speed computations: 
Am2903, 110, 112 
Am2904, 140, 142 
Mapping PROM, 15, 17 
MAR (Memory Address Register), 3-6 
Mask bus, 220 
MDR (Memory Data Register), 3-6 
Memory: 
Bulk, 6 
depth-over-width ratio of, 10 
microprogram, 8, 13 
program to write into, 88 -89 
random access (see Random Access Memory) 
read access time of, 5 
Memory access: — 
direct (see Direct memory access) 
random (see Random Access-Memory) 
Memory Address Register (MAR), 3-6 
Memory addressing scheme: 
with PC in ALU, 193 
with PC outside ALU, 197 
Memory board, Super Sixteen, 362, 378 -381, 384 
Memory control, Super Sixteen, 322, 324, 374-375 
Memory Data Register (MDR),3-6 - 
Memory depth, 5 
Memory immediate instruction, 197 
Memory management, HEX-29, 273-274 
Memory management registers, HEX-29, 264 
Memory mapped 1/O, 239 
Memory mapping program address, 273-274 
Memory read timing, Super Sixteen, 363 
Memory to memory indexed instruction, 196 
Memory to memory instruction, 194 
Memory width, 5 


Memory write timing, Super Sixteen, 364 
Microcode: 
branch and stack instruction, 200 
memory immediate instruction, 197 
memory to memory indexed instruction, 196 
memory to memory instruction, 194 
register Immediate instruction, 196 
register short-immediate instruction, 195 
register to indexed memory instruction, 195 
register to memory immediate instruction, 195 * 
register to memory immediate instruction im- 
proved, 198 
register-to-memory-reference instruction, 194 
register-to-register instruction, 193 
register-to-register instruction with overlap of 
execute and PC control, 198 
return-from-subroutine instruction, 200 
Super Sixteen, 337-345 
Microcode translation, Super Sixteen, 345, 346 
Microcomputer: 
HEX-29 (see HEX-29 CPU) 
16-bit (see Super. Sixteen) 
Microinstruction control, AM2901A, 101 
Microinstruction cycle, 35-37 
Microinstruction fields, 13 
Microinstruction format, Super Sixteen, 330-337 
Microinstruction sat, Am2914, 212-213 
Microinstructions, 13 
execution of, 13 
(See also Instructions) 
Micromachine, 7,8-10 
Microprogram: 
Am2904 AMDASM Phase 1 and Phase 2 listing of, 
168-179 
AMDASM definition and assembly files, 79-87 
CRT controller, 50, 54~56 
Super.Sixteen, 340, 342-345 
Microprogram contro!, 260-261 
HEX-29, 281, 287-288 
Microprogram contro! bits, Am2903, 115, 122 
Microprogram controller, bipolar, 17 
Microprogram counter (mPC), 18 
Microprogram counter register, 15 
Microprogram execution, timing diagram of, 37 
Microprogram instructions, machine level instruc- 
tions versus, 9-10 
Microprogram interrupt, 43 
Microprogram jumping, 13 
Microprogram level interrupt: 
example of, 229-235 
versus machine interrupts, 207 
Microprogram memory, 8, 13 
Microprogram sequencer, 260-261 
Microprogram start-up flow chart, 337 
Microprogram state machine applications, 33-34 
Microprogram structure, Am2904, 144—- 145 
Microprogrammed architecture, 10 
' 


Microprogrammed design, 12-61 
key features of, 13 
Microprogrammed machines, 260-262 
Microprogramming, 259 
Microprogramming control, subroutining in, 15 
Microregister condition code output, Am2904, 181 
Microroutines, sample: 
Am2903, 122-125 
Am2904, 146-159 
Microstatus register, Am2904, 131 
Microstatus register instruction codes, Sollee 
132, 181 
Microword ADD, 123, 124 
Microword ADD DIRECT, 123-125 
Microword ADD IMMEDIATE, 123, 124 
Microword ADD RA1, 123, 125 
Microword FETCH, 122-124 
Microword INIT, 122, 123 
Microword memory, HEX-29, 288 -290 
Microword register, 14 
Mode control, 240° 
Monitor listing, Super Sixteen, 347,358-360 
Most significant slice (MSS), 102 
MOVE LONG instruction description, 353 
mPC (microprogram counter), 18 
MSS (most significant slice), 102 
Muitiplexer (MUX): 
address, 15 
condition code, 15 
data, 242, 251 
three-input, 99 
Muitiplication, two's complement, 150 
microcode, 160-161 
MULTIPLY instruction description, 350 
Multiply unsigned instruction, 147,149 - 
microcode, 160-161 
MULTIPLY UNSIGNED instruction description, 350 
Multiprocessor, interrupts in, 207 
MUX (see Multiplexer) 


N output, 102 
Negative numbers, 191 
Negative single-length number, normalized and un- 
“normalized, 147 
Nested subroutine example, 199 
Nesting: 
of interrupt service routines, 211 
of interrupts, 210 
Non-polling versus polling systems, 207 
Non-restoring binary roots, 153-154 
microcode, 162-163 
Normalization microroutine, 146-147 
Normalized negative single-length number. 147 
Normalized positive number, 146 
Numerical value of zero, 191 
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OP CODE (operation code), 7,9 
Operands, 3 
for operations, 7 » 
Operating system, System 29, disk drive manage- 
ment, 361 
Operation code (OP CODE), 7,9 
Operations, operands for, 7 
OR BYTE instruction description, 351 
OR instruction description, 349 
Output buffers, Am2903, 104 
Output bus, Am2901B, 298, 299 
OUTPUT BYTE instruction description, 354 
Output flags, 99 
Output instruction flow chart, 340, 341 
OUTPUT WORD instruction description, 354 
Overflow, 95 
Overflow detect output flag. 99 
Overflow detection signal (OVR), 93, 101, 102 
Overlap of execute and PC control, 198 
Overlapping: 
FETCH, 14-15 
Super Sixteen, 322 
OVR (overflow detection signal), 93, 101, 102 


Parallel cascade mode, 214-217 
Parallel look-ahead expansion scheme for Am2930, 
202 

PCU (see Program Control Unit) 
Phase 1 and phase 2 periods, 277-278 
Physical page zero, 273 
Pin functions, 17 
Pipeline registers, 14-15, 260-261, 290, 291 
Pipelined microprogram bits, Am2904, 144-146 
Pipelined operations, Super Sixteen, 329-330 
Pipelining: 

FETCH, 14-15 

Super Sixteen, 322 
Polling. 6 

versus non-polting systems, 207 
POP instruction description, 352 
POP operation, 18 
Positive numbers, 191 

normalized and unnormalized, 146 
P/POP instruction description, 352 
P/PUSH instruction description. 352 
Priority schemas in interrupt, 209-210 
Program, 3 

to write into character memory, 88-89 
Program control. HEX-29 mnemonics, 267 
Program Control Unit (PCU), 4-6, 199-204 

Am2901A as, 203, 204 

counter-type, 4 

Super Sixteen, 321, 368-369 
Program control unit performance. improving, 

197-203 
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Program steps, 4 
Programmed I/O, 239 
Programming, 7 
Am2914, 212-213 
PROM.8 
mapping, 15, 17 
Propagate, carry, 94 
Propagation delay calculations: 
on Am2910 microprogram sequencer, 24-<, 
for Am2911 and Am29811A design, 24, 28-31 
Propagation delays, combinational: 
Am2901A, 126 
Am2903, 127 
PUP (push/pop control). 15 
PUSH/CONDITIONAL LOAD COUNTER instruction, 
19-21 
figure, 20 
PUSH instruction description, 352 
PUSH operation, 18 
Push/pop control (PUP), 15 


Q input, 101 
Q register. 99 
Am2903, 103-104 


( input field, 99 
Random Access Memory (RAM), 8, 99 
address cascading, Am2903, 106 
shift network, 99 
two-port, 102 
write enable (RAM EN), 99 
Read access time of memory. 5 
Read control flow chart, 249 
Register immediate instruction, 196 
Register operations, Am2904, 132 
Register with short-immediate instruction, 194-195 
Register to indexed memory (RX) instructions, 195 
addressing, 7 
flow chart of, 337, 338 
sequence of, Super Sixteen 328 
Register to memory immediate instruction, 195-196 
Register-to-memory-reference instruction, 193-194 
Register-to-register (RR) instructions, 191-193 
addressing, 7 
flow chart of 337, 338 
sequence of, Super Sixteen, 328 
Registers: 
Am2914, 213-214 
pipeline. 14-15, 260-261, 290. 291 
Working. 3 
Pelative addressing 4-5 
Rehaoddty system, 277 
Remainder, truc varue of. 157 


REPEAT LOOP, COUNTER ¥ ZERO instruction, 21 
figure, 20 
REPEAT PIPELINE REGISTER, COUNTER # ZERO, 
21 } 
figure, 20 
Repetitive DMA, 240 
RESET instruction, 18, 19 
figure, 20 
Restore after interrupt service routine, 208 
Return-from-interrupt sequence timing, 234, 235 
Return-from-subroutine command, 198 
RETURN-FROM-SUBROUTINE instruction, 15, 21 
figure, 20 
Return-from-subroutine instruction microcode, 200 
RETURN instruction description, 352 
Return interrupt microprogram, 220, 223, 231 
Return register, 198 
Ripple carry method usages, 95 
Ripple cascade mode, 214-217 
Ripple expansion scheme for Am2930, 202 
Ripple propagation time, 93 
ROTATE LEFT instruction description, 354 
ROTATE RIGHT instructions, description of, 354 
Rotating structure interrupt scheme, 209-210 
RR instructions | see Register-to-register (RR) in- 
structions} 
RX instructions [see Register to indexed memory 
(RX) instructions} 


$ (sum output), 93 
S input field, 99 
S/29 (see System/29) 
Save status, 207 
Schottky speeds, standard device, 107 
Am2904, 135 
SET, CLR, COMPLEMENT, TEST BIT PSW instruc- 
tion description, 352 
Set-up and hold times: 
Am2901A, 126 
Am2903, 127 
Shift and rotate instruction flow chart, 339, 
341 
Shift-down operations, 115 
SHIFT LEFT instructions, description of, 354 
Shift linkage multiplexer instruction codes, 
Am2904, 133 
Shift linkage multiplexers, Am2904, 131-132 
Shift network, RAM, 99 
SHIFT RIGHT instructions, description of, 353 
Shift-up operations, 115 
Shifter, 3,4 
Sign bit output flag, 99 
Signal delay path, longest, 23-24 
Singie-length normalize command. 146-148 
microcode, 162-163 


Slice: ; 
intermediate (IS), 102 
least significant (LSS), 102 
most significant (MSS), 102 
Software, 7 
Software emulation of CRT controller, 66-77 . 
SP (stack pointer), 4, 15, 18 
SSI/MS1, 135, 143 
Stack, full, 19 
Stack addressing. 265 
Stack and link, 198 
Stack pointer (SP), 4, 15 
built-in, 18 
Standard device Schottky speeds, 107 
Am2904, 135 
Static structure interrupt scheme, 209 
Status bus, 220 
Status registers, Am2904, 131 
STORE BYTE instruction description, 350 
STORE CHARACTER instruction description, 350 
STORE instruction description, 349 
STORE PROGRAM STATUS WORD instruction de- 
scription, 351 
STORE STACK instructions, description of, 352 
Stored-program computer. 3-7 
Subroutine example, nested. 199 
Subroutine stack architecture, 200 
Subroutining, 198-202 
in microprogramming control, 15 
SUBTRACT instruction description, 349 
SUBTRACT WITH CARRY instruction description, 
349 
Sum output (S), 93 
Super Sixteen, 318-384 
ALU, 321-323, 366-367 
central processing unit architecture, 321-330 
logic diagrams, 365-384 
central processing unit read timing, 325 
clock and memory control, 322, 324, 374-375 
computer control unit, 322 
data path, 324, 370-371 
engineering model, 346-348 
instruction descriptions, 349-355 
instruction format, 321 
instructions, 319-321 
interrupt and I/O, 324,376-377 
I/O write timing, 328 
macro library, 347, 356-357 
memory board, 362, 378-381, 384 
memory read timing, 363 
memory write timing, 364 
microcode, 337-345 
microcode translation, 345, 346 
microinstruction format, 330-337 * 
microprogram, 340, 342-345 
monitor listing, 347, 358-360 
pipelined operations, 329-330 
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Super Sixteen (Cont.): 
RR instruction sequence, 328 
RX instruction sequence, 328 
S/29 WCS interface. 382-384 
system organization, 319 
SUPERVISOR CALL instruction description, 
352 
Switching characteristics: 
Am2901A, 125, 126 
Am2903, 125, 127 
preliminary Am2904, 135 
Sync control logic, 232, 233 
System reliability, 277 
System/29, 10, 345, 347-348 
operating system disk drive management, 361 
universal card, 88 
WCS interface, Super Sixteen, 382-384 


TEST END-OF-LOOP instruction, 22 
figure, 20 
TEST IMMEDIATE instruction description, 350 
THREE-WAY BRANCH instruction, 22 
figure, 20 
Time delay using Am2914, 220, 224-22 
Timing: . 
Am2940, 243-247 
bit slice, 106-113 
CCU, 22-33 
HEX-29 system, 277-281 
interrupt sequence, 234, 235 
return-from-interrupt sequence, 234. 235 
Timing analysis, Am2904, 135 
Timing analysis summary, 144 
Timing considerations, CRT controller, 50, 57-61 
Timing diagram of microprogram execution, 37 
Transfer complete circuitry: 
Am2940, 242 
Am2942, 251 
TRANSLATE AND TEST instruction description, 
353 
TRANSLATE instruction description, 352 
Trap, 275 
Truth table, full adder, 93 
Two-port RAM, 102 
Two's complement arithmetic operation with shift 
down and 16-bit speed computations, Am2903, 
112 
Two's complement arithmetic operation with shift 
down speed computations: 
Am2901A, 109 
Am2904, 139 
Two's complement division, 150-153 
microcode, 160-161 
Two's complement multiplication, 150 
microcode, 160-161 
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Unconditional branch flow chart, 339 
UNCONDITIONAL JUMP, 14-15 
Unconditional operation. 4 
Unnormalized negative single-length number, 147 
Unnormalized positive number, 146 
Unsigned multiply, 147, 149 

microcode, 160-161 


Vector bus, 220 
Vector generator, 232, 233 
Vector input, 42-43 


Width, memory, 5 
Word count compare mode: 
Am2940, 242 
Am2942, 251 
Word count equals zero mode, Am2942, 251 
Word count maintenance, 240 
Word count register: 
Am2940, 242 
Am2942, 251 
Word counter: 
Am2940, 242 
Am2942, 251 


Word counter carry out mode: 
Am2940, 243 
Am2942, 251 
Word lengths, 191 
Words, 5 
16-bit, 191 
Working Registers, 3 
Write-protect bit (WP), 273 


XOR BYTE instruction description, 351 
XOR instruction description, 350 


Y outputinstruction codes, Am2904, 135 
Y outputs, 101 
three-state, 18 


Zero, numerical value of, 191 
Zero detect output flag, 99 
ZERO instruction, 43 
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